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Abstract 

We consider a family of random trees satisfying a Markov branching property. Roughly, this 
property says that the subtrees above some given height are independent with a law that depends 
only on their total size, the latter being either the number of leaves or vertices. Such families 
are parameterized by sequences of distributions on partitions of the integers, that determine 
how the size of a tree is distributed in its different subtrees. Under some natural assumption 
^Sj on these distributions, stipulating that "macroscopic" splitting events are rare, we show that 

^ Markov branching trees admit the so-called self-similar fragmentation trees as scaling limits in 

^ the Gromov-Hausdorff-Prokhorov topology. 

The main application of these results is that the scaling limit of random uniform unordered 
^NJ trees is the Brownian continuum random tree. This extends a result by Marckert-Miermont and 

fully proves a conjecture by Aldous. We also recover, and occasionally extend, results on scaling 
limits of consistent Markov branching model, and known convergence results of Galton-Watson 
trees towards the Brownian and stable continuum random trees. 



^ 1 Introduction and main results 
g 

i_i The goal of this paper is to discuss the scaling hmits of a model of random trees satisfying a simple 

Markovian branching property, that was considered in [26], and considered in different forms in a 



^ number of places ^15t d2\- Markov branching trees are natural models of random trees, defined 

CN in terms of discrete fragmentation processes. The laws of these trees are indexed by an integer 

n giving the "size" of the tree, which leads us to consider two distinct (but related) models, in 
^T) which the sizes are respectively the number of leaves and the number of vertices. We first provide 

a slightly informal description of our results. 
<0 Let q = {qn,n > 1) be a family of probability distributions, respectively on the set Vn of 

partitions of the integer n, i.e. of non-increasing integer sequences with sum n. We assume that g„ 
• • does not assign mass 1 to the trivial partition (n): 

• ^ 

^ <ln{{n)) < 1 , for every n > 1 . 



In order that this makes sense for n = 1, we add an extra "empty partition" to "Pi. 

One constructs a random rooted tree with n leaves according to the following procedure. Start 
from a collection of n indistinguishable balls, and with probability . . . , Xp), split the collection 

into p sub-collections with Ai, . . . , Ap balls. Note that there is a chance g„((n)) < 1 that the collec- 
tion remains unchanged during this step of the procedure. Then, re-iterate the splitting operation 
independently for each sub-collection using this time the probability distributions qx^, ■ ■ ■ , qx^ ■ If 
a sub-collection consists of a single ball, it can remain single with probability gi((l)) or get wiped 
out with probability gi(0). We continue the procedure until all the balls are wiped out. There is 
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Figure 1: A sample tree Tn. The first splitting arises with probability gii(4, 4, 3). 



a natural genealogy associated with this process, which is a tree with n leaves consisting in the n 
isolated balls just before they are wiped out, and rooted at the initial collection of n balls. See 
Figure [l] for an illustration. We let be the law of this tree. 

This construction can be seen as the most general form of splitting trees of Broutin &: al. [T2], 
and was referred to as trees bearing the so-called Markov branching property in [26j. There is also a 
variant of this procedure that constructs a random tree with n vertices rather than n leaves. This 
one does not need the hypothesis qn{{n)) < 1 for n > 1, and in fact we only assume gi((l)) = 1 for 
consistency of the description to follow. Informally, starting from a collection of n balls, we first 
remove a ball, split the n — 1 remaining balls in sub-collections with Ai, . . . , Ap balls with probability 
Qn~i{{M, ■ ■ ■ , ^p)), and iterate independently on sub-collections until no ball remains. We let Qn 
be the law of the random tree associated to this procedure. 

While most papers so far have been focusing on families of trees having more structure, such as 
a consistency property when n varies |3t 126^ [T5] (with the notable exception of Broutin & al. |12j). 
the main goal of the present work is to study the structure of trees with laws Pn or as n — )• cxd 
in a very general situation. The main assumption that we make is that, as n — )• oo, 

macroscopic splitting events of the form n — )• {nsi,ns2, . . .) E Vn for a non-increasing 
sequence s = (si,S2, . . .) with sum 1 and such that si < 1 — e, for some e G (0, 1), 
are rare events, occurring with probability of order n~^i's{ds) for some 7 > 0, for some 
finite "intensity" measure v^- 

Note that the measures should satisfy a consistency property as e varies, and as e goes to 0, 
z^e should increase to a possibly infinite measure on the set of non-increasing sequences with sum 
1. This means that splitting events that only remove tiny parts from a large collection of balls 
are allowed to remain more frequent than the order n~'^ . Under this assumption, formalized in 
hypothesis (H) below, we show in Theorem [l] that a tree Tn with law P^, considered as a metric 
space by viewing its edges as being real segments of lengths of order n~'^, converges in distribution 
towards a limiting structure 7^,z/, the so-called self-similar fragmentation tree of [24j : 
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When 7 G (0, 1), a similar result (Theorem [2]) holds when T„ has distribution Qn 
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The limiting tree T-y^i, can be seen as the genealogical tree of a continuous model for mass 
splitting, in some sense analogous to the Markov branching property described above. The above 
convergence holds in distribution in a space of measured metric spaces, endowed with the so-called 
Gromov-Hausdorff-Prokhorov topology. This result contrasts with the situation of [12] , where it is 
assumed that macroscopic splitting events occur at every step of the construction. In that case, the 
height of Tn is of order log n, and no interesting scaling limit exists. A key step in our study will be 
to use the results from [25], where scaling limits of non-increasing Markov chains were considered: 
such Markov chains are indeed obtained by considering the successive sizes of collections containing 
a particular marked ball when going up in the tree r„. 

This general statement allows to recover, and sometimes improve, many results of |26 1l27l[33l ll4j 
dealing specifically with Markov branching trees. It also applies to models of random trees that 
are not a priori directly connected to our study. In particular, we recover the results of Aldous [3] 
and Duquesne [T7] showing that the so-called Brownian and stable trees [U EHl [lEl US] are universal 
limits for conditioned Galton- Watson trees. 

More notably, our results allow to prove that uniform unordered trees with n vertices and 
degrees bounded by some integer m + 1 £ [3, oo] admit the Brownian continuum random tree as a 
scaling limit. This was conjectured by Aldous [2] and proved in [2^ in the particular case m = 2 of 
a binary branching, using completely different methods from ours. The difficulty of handling such 
families of random trees comes from the fact that they have no 'nice' probabilistic representations, 
using for instance branching processes or growth models. As a matter of fact, uniform random 
unordered trees do not even have the Markov branching property, but turn out to 'almost' bear 
this property, in a sense that will be explained below. 

The rest of this section is devoted to a detailed formalization of our results. Throughout the 
paper, we use the notations 

N = {1,2,3, ...}, Z+ = {0}UN, [n] = {1,2, ...,n}, nGN. 

The random variables appearing in this paper are either canonical or defined on some probability 
space (0,-F,P). 

Acknowledgment. We are deeply indebted to Jean-Frangois Marckert, who suggested that the 
methods of [26j could be relevant to tackle the problem of scaling limits of uniform unordered trees. 
This provided the initial spark for the present work and |25j . 

1.1 Discrete trees 

We briefly introduce some formalism for trees. Set N*^ = {0}, and let 

= J N" . 

n>0 

For u = {ui, . . . ,Un) G we denote by \u\ the length of n, also called the height of u. If u = 
(ui, . . . , Un) with n > 1, we let pr(n) = (ui, . . . , Un-i), and for z > 1, we let ui = (ui, . . . , Un,i)- 
More generally, for u = (ui, . . . , u„) and v = {vi, . . . , Vm) in U, we let uv = {ui, . . . , n„, vi, . . . , Vm) 
be their concatenation, and for A C U and u £ U, we let uA = {uv : v G A}, and simply let 
iA = {i)A for i G N. We say that u is a prefix of v if v £ uU, and write u ^ v, defining a partial 
order on U. 

A plane tree is a non-empty, finite subset t dli (whose elements are called vertices) , such that 

• If u G t with \u\ > 1, then pr(u) G t, 

• If n G t, then there exists a number Cu{t) G (the number of children of u) such that ui £ t 
if and only if 1 < i < Cu(t). 
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Let dt = {u & t : Cu{t) = 0} be the set of leaves of t. If t^^\ . . . , t^*^) are plane trees, we can define 
a new plane tree by 

k 

(tW,...,t('=)) = {0}u(Jit». 

1=1 

A plane tree has a natural graphical representation, in which every n G t is a vertex, joined to 
its Cu{t) children by as many edges. But t carries more information than the graph, as it has a 
natural ordered structure. In this work, wc will not be interested in this order, and we present one 
way to get rid of this unwanted structure. Let t be a plane tree, and cr = (cTu, ti G t) be a sequence 
of permutations, respectively au G &cu{t)- For u = {ui, . . . , Un) £ t, let 

(r{u) = (cr0(ui),(7(„j)(M2),O-(«i,«2)(^3),---,O-(«l,...,«„_l)('"n)) ' 

and <t(0) = 0. Then the set at = {c(n) : u € t} is a plane tree, obtained intuitively by shuffling 
the set of children of u in t according to the permutation cj„. We say that t,t' are equivalent if 
there exists some a such that f(t) = t'. Equivalence classes of plane trees will be called (rooted) 
unordered trees, or simply trees as opposed to plane trees, and denoted by lowercase letters t. 

When dealing with a tree t, we will freely adapt some notations from plane trees when dealing 
with quantities that do not depend on particular plane representatives. For instance, #t, ^dt will 
denote the number of vertices and leaves of t, while 0, C0(t) will denote the root of t and its degree. 

We let T be the set of trees, and for n > 1, 

= {t G T : #5t = n} , T„ = {t G T : #t = n} 

be the set of trees with n leaves, resp. n vertices. The class of {0} is the vertex tree • G Ti = T^^. 

Heuristically, the information carried in a tree is its graph structure, with a distinguished "root" 
vertex corresponding to 0, and considered up to root-preserving graph isomorphisms — it is not 
embedded in any space, and its vertices are unlabeled. 

It is a simple exercise to see that if t^*) , 1 < i < A; are trees, and t^*-* is a choice of a plane 
representative of t*-*) for each i, then the class of (t^*-*, 1 < i < k) does not depend on the particular 
choice for t^^\ We denote this common class by (t(*),l < i < k). Note that j{t) := (t) can be 
seen as the tree t whose root has been attached to a new root by an edge, and f{t), for Z > 0, is 
tree t whose root has been attached to a new root by a string of / edges. For instance, is the 

line-tree consisting of a string with length I, rooted at one of its ends. Finally, for trees t^^\ . . . , t^*) 
and I > 1 we let 

(tW,...,t(^)),=/((tW,...,tW)), 
so fi*) = with these notations. 

1.2 Markov branching trees 

A partition of an integer n > 1 is a sequence of integers A = (Ai, . . . , Xp) with Ai > . . . > Ap > 1 
and Ai -|- . . . -|- Ap = n. The number p = p{X) is called the number of parts of the partition A, and 
the partition is called non-trivial if p{X) > 2. We let Vn be the set of partitions of the integer n. 
We also add an extra element to "Pi, so that Vi = {(1), 0}. 
For X eVn and 1 < j < n, we define 

m,(A) = #{iG{l,2,...,p(A)}:Ai=j}, 

the multiplicity of parts of A equal to j. 

By convention, it is sometimes convenient to set Aj = for i > p{X), and to identify the sequence 
A with the infinite sequence (Aj, i > 1). Such identifications will be implicit when needed. 
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1.2.1 Markov branching trees with a prescribed number of leaves 



In this paragraph, the size of a tree t G T is going to be the number #5t of its leaves. 
Let q = {qn,n > 1) be a sequence of probability distributions respectively on Vn, 

9n = (9n(A), A e Pn) , ^ g„(A) = 1 , 

such that 

(?„((n))<l, n>l. (1) 
Consider a family of probability distributions Pn,?^ > 1 on respectively, such that 

1. is the law of the line-tree (•)g) where G has a geometric distribution given by 

P(G = A;)=gl(0)(l-gl(0))^ A;>0, 

2. for n >2, Pn is the law of 

(r«,l<i<p(A)), 

where A has distribution q^, and conditionally on the latter, the trees T^^\ 1 < i < Pi-^) 
independent with distributions P^. respectively. 

Alternatively, for n > 2, Pn is the law of (T^*^ 1 < i < p{A))g where G is independent of A and 
geometric with 

P(G = A:) = (l-(?„((n)))g„((n))^ A; > 1 , 

and conditionally on A with law qn{- \ Vn \ {(n)}), the trees T^^\ . . . , T^^^^^^ are independent with 
distributions Pa, respectively. A simple induction argument shows that there exists a unique family 
P^ satisfying the properties 1 and 2 above. 

A family of random trees T„,n > 1 with respective distributions Pnjn > 1 is called a Markov 
branching family. The law of the tree T„ introduced in the beginning of the Introduction to describe 
the genealogy of splitting collections of n balls is P^. 



1.2.2 Markov branching trees with a prescribed number of vertices 

We now consider the following variant of the above construction, in which the size of a tree t is 
the number of its vertices. For every n > 1, let again g„ be a probability distribution on Vn- We 
do not assume ([T]), rather, we make the sole assumption that gi((l)) = 1. For every n > 1, we 
construct inductively a family of random trees Tn respectively in the set of trees with n vertices, 
by assuming that for A = (Ai, . . . , Ap) S Vn-i, with probability g„_i(A), the n — 1 vertices distinct 
from the root vertex are dispatched in p subtrees with Ai > . . . > Ap vertices, and that, given these 
sizes, the p subtrees are independent with same distribution as Ta^, . . . ,Tx^ respectively. 
Formally, 

1. let Ql be the law of •, and 

2. for n > 1, let Q^_|_i be the law of 

(T«,l<i<p(A)), 

where A has distribution qn, and conditionally on the latter, the trees T^*\ 1 < i < p{-^) &re 
independent with distributions Q^, respectively. 

By induction, these two properties determine the laws Qn,n > 1 uniquely. 

The construction is very similar to the previous one, and can in fact be seen as a special case. 



after a simple transformation on the tree; see Section 4.5 below. 
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1.3 Topologies on metric spaces 

The main goal of the present work is to study scaling limits of trees with distributions Pn, Qn, as n 
becomes large. For this purpose, we need to consider a topological "space of trees" in which such 
limits can be taken, and define the limiting objects. 

A rooiecj^ metric space is a triple {X,d,p), where {X,d) is a metric space and p £ X is a, 
distinguished point, called the root. We say that two rooted spaces {X, p, d), {X' , p' , d') are isometry- 
equivalent if there exists a bijective isometry from X onto X that sends p to p' . 

A measured, rooted metric space is a 4-tuple {X,d, p, p), where {X,d,p) is a rooted metric 
space and is a Borel probability measure on X. Two measured, rooted spaces {X, d, p, p) and 
{X, d' , p' , p') are isometry-equivalent if there exists a root-preserving, bijective isometry <j) from 
{X,d,p) to {X,d',p') such that the push-forward of /U by is p' . In the sequel we will almost 
always identify two isometry-equivalent (rooted, measured) spaces, and will often use the shorthand 
notation X for the isometry class of a rooted space or a measured, rooted space, in a way that 
should be clear from the context. Also, if X is such a space and a > 0, then we denote by aX the 
space in which the distance function is multiplied by a. 

We denote by ^ the set of equivalence classes of compact rooted spaces, and by the set 
of equivalence classes of compact measured spaces. 

It is well-known (this is an easy extension of the results of f2T|) that ^ is a Polish space 
when endowed with the so-called pointed Gromov-Hausdorff distance dQ}i, where by definition the 
distance dcHiiX, p), {X' , p')) is equal to the infimum of the quantities 

dist((/.(/5), 0'(p')) V distH(0(X), 0'(X')) 

where (j)^ (j)' are isometries from X, X' into a common metric space (M, dist), and where distn is the 
Hausdorff distance between compact subsets of (M, dist). It is elementary that this distance does 
not depend on particular choices in the equivalence classes of (X, p) and {X' , p'). We endow ^ with 
the associated Borel cr-algebra. Of course, (icH satisfies a homogeneity property, dQ,Y{[aX,aX') = 
adGY{{X,X') for a > 0. 

We also need to define a distance on that is in some sense compatible with the Gromov- 
Hausdorff distance. Several complete distances can be constructed, and we use a variation of the 
Gromov-Hausdorff-Prokhorov distance used in |30j . The induced topology is the same as that 
introduced earlier in [22j. The reader should bear in mind that the topology used in the present 
paper involves a little extension of the two previous references, since we are interested in pointed 
spaces. We let dGHp((-^5 p, p), {X' , d' , p' , p')) be the infimum of the quantities 

dist(0(p), cp'ip')) V distH(0(X), cP'iX')) V distp(,/.*/i, M , 

where again (p, (j)' are isometries from X, X' into a common space (M, dist), 0*/^, (p'^^p' are the push- 
forward of p, p' by 4>, (j)', and distp is the Prokhorov distance between Borel probability measures 
on M [201 Chapter 3]: 

distp (m, m) = inf{e > : m{C) < rn{C^) + e for every C C M closed} , 

where C"^ = {x G M : infj^gc" dist(x, y) < e} is the e-thickening of C. A simple adaptation of 
the results of [22] and Section 6 in [30| (in order to take into account the particular role of the 
distinguished point p) shows that: 

Proposition 1. The junction (iGHP o, distance on ^„ that makes it complete and separable. 

This distance is called the pointed Gromov-Hausdorff-Prokhorov distance. One must be careful 
that contrary to dcu, this distance is not homogeneous: dQup^aX, aX') is in general different from 
adQiip{X, X'), because only the distances, not the measures, are multiplied in aX,aX'. 

^usually such spaces are rather called pointed, but we prefer the term rooted which is more common when dealing 
with trees 
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1.3.1 Trees viewed as metric spaces 



A plane tree t can be naturally seen as a metric space by endowing t with the graph distance 
between vertices. Namely, 



where u Av is the longest prefix common to u, v. This coincides with the number of edges on the 
only simple path going from u to v. The space (t,dgr) is naturally rooted at 0. We can put two 
natural probability measures on t, the uniform measures on the leaves or on the vertices: 



If t € T is a tree and t,t' are two plane representatives of t, then it is elementary that the 
spaces (t, dgr, 0, /iat) and {t' , dgr , , Hdt') are isometry-equivalent rooted measured metric spaces. 
The same holds with /xt)/^t' instead of Hdt^t^dt'- We denote by (t, dgr, //gt) and (t, dgr, /o, /xt) the 
corresponding elements of Conversely, it is possible to recover uniquely the discrete tree (not 
a plane tree!) from the element of thus defined. 

1.3.2 M-trees 

An M-tree is a metric space {X, d) such that for every x,y £ X, 

1. there is an isometry ipx^y : [0,d{x,y)] — X such that ifx,y{0) = x and Lpx^y{d{x,y)) = y, and 

2. for every continuous, injective function c : [0, 1] ^ X with c(0) = x, c(l) = y, one has 



In other words, any two points in X are linked by a geodesic path, which is the only simple path 
linking these points, up to reparameterisation. This is a continuous analogue of the graph-theoretic 
definition of a tree as a connected graph with no cycle. We denote by the range of ipx,y 

We let ^ (resp. be the set of isometry classes of compact rooted M-trees (resp. compact, 
rooted measured M-trees). An important property is the following (these are easy variations on 
results by ED El). 

Proposition 2. The spaces and 5^ are closed subspaces of den) and (^„,(iGHp)- 

If T G and for x £ T, we call the quantity d{p, x) the height of x. If x, y G T, we say that 
X is an ancestor of y whenever x £ [[p, y]]. We let x A y G T be the unique element of T such that 
[[p, x]] n [[p, y]] = [[p, X Ay]], and call it the highest common ancestor of x and y in T. For x £ T, 
we denote by Tx the set of y £ T such that x is an ancestor of y. The set 7^, endowed with the 
restriction of the distance d, and rooted at x, is in turn a rooted M-tree, called the subtree of T 
rooted at x. If {T,d,p,p) is an element of and p(Tx) > 0, then Tx can be seen as an element 
of by endowing it with the measure /x(- n Tx)/ p-{Tx)- 

We say that x £ T, x ^ p, m. a, rooted M-tree is a leaf if its removal does not disconnect T. 
Note that this always exclude the root from the set of leaves, which we denote by C{T)- A branch 
point is an element of T of the form x Ay where x is not an ancestor of y nor vice- versa. It is also 
characterized by the fact that the removal of a branch point disconnects the M-tree into three or 
more components (two or more for the root). We let B{T) be the set of branch points of T- 



dgr{u, v) = \u\ + \v 



-2\u Av 



u,v £ t 




c{[0,l]) = ipx,y{[0,d{x,y)]). 



7 



1.4 Self-similar fragmentations and associated M-trees 

Self-similar fragmentation processes are continuous-time processes that describe the dislocation of 
a massive object as time passes. Introduce the set of partitions of a unit mass 

:= {s = {si,S2, . . .) : si > S2 > . . . > 0,J2si < ly 

i>l 

This space is endowed with the £^-metric d{s, s') = X]j>i ~ ■^ili which makes it a compact space. 

Definition 1. A self- similar fragmentation is an S^-valued Markov process (X-{t),t > 0) which 
is continuous in probability and satisfies the following fragmentation property. For some a G M, 
called the self- similarity index, it holds that conditionally given X(t) = (si,S2, ■ ■ ■), the process 
(X(t -|- t'),t' > 0) has same distribution as the process whose value at time t' is the decreasing 
rearrangement of the sequences SiX*^*)(s"t'), i > 1, where (X*^*),i > 1) are i.i.d. copies of^. 

Bertoin [8] and Berestycki [6] have shown that the laws of self-similar fragmentation processes are 
characterized by three parameters: the index a, a non-negative erosion coefficient, and a dislocation 
measure u on S'^. The idea is that every sub-object of the initial object, with mass x say, will 
suddenly split into sub-sub-objects of masses xsi, XS2, ■ ■ ■ at rate x°z/(ds), independently of the 
other sub-objects. Erosion accounts for the formation of zero-mass particles that are continuously 
ripped off the fragments. 

For our concerns, we will consider only the special case where the erosion phenomenon has no 
role and the dislocation measure does not charge the set {s E ^'l' : Si < 1}. One says that u is 
conservative. This motivates the following definition. 

Definition 2. A dislocation measure is a a-finite measure v on such that 0, 0, . . .)}) = 

and 

z^d^Si < l}) =0, I (1 - si)i/(ds) < oo . (2) 
i>i -^^^ 

We say that the measure is binary when + S2 < 1}) = 0. ^ binary measure is characterized 

by its image j^(si G dx) through the mapping s i— )• si. 

A fragmentation pair is a pair (a, v) where a GM is called the self- similarity index, and v is a 
dislocation measure. 

Fragmentation pairs (a, i') therefore characterize the distributions of the self-similar fragmen- 
tations we are focusing on. When o = —7 < 0, small fragments tend to split faster, and it turns 
out that they all disappear in finite time, a property known as formation of dust. Using this prop- 
erty, it is shown in [23| how to construct a "continuum random tree" encoding the process with 
characteristic pair (—7, i'). This tree is a random element of 

T/,u = (T, d, p, ^) , 

such that (T, d) is a random M-tree, and such that almost-surely 

1. the measure /x is supported on the set C{T) of leaves of T 

2. p. has no atom, 

3. for every x £ T \ C{T), it holds that /u(7^) > 0. 

Proposition 3 (Theorem 1 and Proposition 1 in |24j). The law ofTy^u is characterized by properties 
1, 2, 3 above, together with the fact that if t > and Ti{t),i > 1 are the connected components of 
the open set {x G T : d{p,x) > t}, then the process {{iJ,{Ti{t)) , i > l)^,t > 0) of the non-increasing 
rearrangement of the p-masses of these components has the same law as the -valued self-similar 
fragmentation with characteristic pair (—7,2^). 
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The previous proposition is not really constructive, and we postpone a more detailed description 
of T-y^u to Section 3.2 

It was shown in 1241 that one can recover the celebrated Brownian and stable continuum random 



trees [T| 128^ [T8] as special instances of fragmentation trees. The parameters 7 and u corresponding 



to these trees will be recalled when we discuss applications in Sections 2.1 and 2.2 



1.5 Main results 

Let (g„(A),A G Vn),n > 1 satisfy ([l]). With it, we associate a finite non-negative measure on 
S^, defined by its integral against measurable f : ^ M+ as 

Uf)= E 1n{X)f(^ 

Note that in the left-hand side, we have identified A/n with an element of S^., in accordance with 
our convention that A is identified with the infinite sequence (Aj,i > 1). We make the following 
basic assumption. 

(H) There exists a fragmentation pair (—7, v\ with 7 > 0, and a function I : (0, 00) — ?■ 
(0, 00) slowly varying at 00, such that we have the weak convergence of finite non- 
negative measures on S^: 

n^^(n)(l -si)g„(ds) ^ (1 - si)zy(ds) . (3) 

Theorem 1. Assume q = (gn(A), A G Vn),n > 1 satisfies assumption (H). Let Tn have distribution 
Pn, and view Tn as a random element of by endowing it with the graph distance and the uniform 
probability measure f^dTn on dTn- Then we have the convergence in distribution 

n^t[n) n-s>oo 

for the rooted Gromov-Hausdorff-Prokhorov topology. 

There is a similar statement for the trees with laws Q^. Consider a family (gn(A), A G Vn), n > 1 
with gi((l)) = 1. 

Theorem 2. Assume q = (gn(A), A G Vn),^ > 1 satisfies assumption (H), with 

• either 7 G (0, 1), or 

• 7 = 1 and l{n) — )• as n —)• 00. 

Let Tn have distribution Qn. We view Tn as a random element of by endowing it with the 
graph distance and the uniform probability measure fiT^ on Tn. Then we have the convergence in 
distribution 

n't[n) n-s>oo 

for the rooted Gromov-Hausdorff-Prokhorov topology. 

Theorem [2] deals with a more restricted set of values of values of 7 than Theorem [Tj This comes 
from the fact that, contrary to the set T^ which contains trees with arbitrary height, the set T„ of 
trees with n vertices has elements with height at most n — 1. Therefore, we cannot hope to find 
non-trivial limits in Theorem [2] when 7 > 1, or when 7 = 1 and £{n) has limit -|-oo as n — )• 00. The 
intermediate case where £{n) admits finite non-zero limiting points cannot give such a convergence 
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with a continuum fragmentation tree in the hmit either. The reason being that the support of the 
height of a continuum fragmentation tree is unbounded, whereas the heights of Tn/ni{n) are all 
bounded from above by 1/ inf„(^(n)), which is finite under our assumption. 

Note that Theorem [T] (resp. Theorem [2]) implies that any fragmentation tree Ty^u is the con- 
tinuous limit of a rescaled family of discrete Markov branching trees with a prescribed number of 
leaves (resp. with a prescribed number of vertices, provided 7 < 1), since we have the following 
approximation result. 

Proposition 4. For every fragmentation pair (—7, i') with 7 > 0, there exists a family of distribu- 
tions {qn, n > 1) satisfying |Ip and such that ^ holds, with £{x) = 1 for every a; > 0. 

After some preliminaries gathered in Section |3j we prove Theorems [T] and [2] and Proposition |4] in 
Section [4j Before embarking in the proofs, we present in Section [2] some important applications of 
these theorems to Galton- Watson trees, unordered random trees and particular families of Markov 
branching trees studied in earlier works. Of these applications, the first two actually involve a 
substantial amount of work, so that the details are postponed to Section [5] and [6j 



2 Applications 

2.1 Galton- Watson trees 

A natural application is the study of Galton- Watson trees conditioned on their total number of 
vertices. Let ^ be a probability measure on Z_|_ such that .^(0) > and 

Y^kak) = l. (4) 

fc>0 

The law of the Galton- Watson tree with offspring distribution ^ is the probability measure on the 
set of plane trees defined by 

Gwg(w)=n^(^«(*))' 

for t a plane tree. That this does define a probability distribution on the set of plane trees comes 
from the fact that a Galton- Watson process with offspring distribution ^ becomes a.s. extinct in 
finite time, due to the criticality condition Q. In order to fit in the framework of this paper, we 
view GWg as a distribution on the set of discrete, rooted trees, by taking its push-forward under 
the natural projection from plane trees to trees. 

In order to avoid technicalities, we also assume that the support of ^ generates the additive 
group Z. This implies that GWg({#t = n}) > for every n large enough. For such n, we let 

GW^""^ = GW^(- 1 {#t = n}), and view it as a law on T„. 
We distinguish two different regimes. 

Case 1. The offspring distribution has finite variance 

= ^k{k-l)^{k) < 00. 

A;>0 



Case 2. For some a G (1, 2) and c E (0, 00), it holds that ^{k) ^ ck " ^ as A; — )• 00. In particular, 
^ is in the domain of attraction of a stable law of index a. 

The Brownian dislocation measure is the unique binary dislocation measure such that 



1^2{S1 edx) = J ^^3^^_^^3 dx l{l/2<x<l} • 
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Otherwise said, for every measurable f : ^ M+, 



[ z^2(ds)/(s) = /" W — ^-^ -3 drc/(x,l -x,0,0, ...) 



We also define a one-parameter family of measures in the following way. For a G (1,2), let ^j>]^ 
be a Poisson random measure on (0, 00) with intensity measure 

1 dx 

ar(l-i)a;i+V«^^">o>' 

with the atoms Ai,i > 1 labeled in such a way that Ai > A2 > . . .. Let T = Yli>i which is 
finite a.s. by standard properties of Poisson measures. In fact, T follows a stable distribution with 
index 1/a, with Laplace transform 

E[exp(-AT)] = exp(-Ai/") , A > . 

It can be seen as a stable subordinator evaluated at time 1, its jumps up to this time being the atoms 
Aiji ^ !)• The measure Ua is defined by its action against a measurable function f : ^ M+: 

2-rrn 1 /^.\ r 



f / , X X " r(2 - l/a) 

/ z.„(ds)/(s = — pT^^ 

Jsi r(2 - a) 



T 



Because E[T] = 00, this formula defines an infinite c-finite measure on S'^, which turns out to 
satisfy Q. 

Theorem 3. Let ^ satisfy with support that generates the additive group Z. Let Tn be a random 
element of T„ with distribution GW^"^ . Consider Tn as an element of by endowing it with the 
graph distance and the uniform probability measure fJ,T„ on Tn- Then we have, in distribution for 
the Gromov-Hausdorff-Prokhorov topology, 

Case 1 : -i=J-n — > -Ii/2m-2 , 

1 ^ [d) ( a{a - 1) ^ 



n 



This result will be proved in Section jsj below, by first showing that GW^"^ is of the form for 
some appropriate choice of q. 

The trees Ti/2,U2 'Ti-i/a,ua appearing in the limit are important models of continuum random 
trees, called respectively the Brownian Continuum Random Tree and the stable tree with index a. 
The Brownian tree is somehow the archetype in the theory of scaling limits of trees. The above 
theorem is very similar to a result due to Duquesne [TYj , but our method of proof is totally different. 
While [T7j relies on quite refined aspects of Galton- Watson trees and their encodings by stochastic 
processes, our approach requires only to have some kind of global structure, namely the Markov 
branching property, and to know how mass is spread in one generation. We do not claim that 
our method is more powerful than the one used in [17J (as a matter of fact, the limit theorem of 
|17j holds in the more general case where /u is in the domain of attraction of a totally asymmetric 
stable law with index a £ (1,2]). However, our method has some robustness, allowing to shift from 
Galton- Watson trees to other models of trees. Our next example will try to illustrate this. 
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2.2 Uniform unordered trees 



Our next application is on a different model of random trees, which is by nature not a model 
of plane or labelled trees, contrary to the previous examples. It is actually not either a Markov 
branching model, but is very close from being one, as we will see. 

For 2 < m < cxD, we consider the set T^'"'' C T„ of trees with n vertices, in which every vertex has 
at most m children. In particular, we have Tn°°^ = T„. The sets T^™''' are harder to enumerate than 
sets of ordered or labeled trees, like plane trees or Cayley trees, and there is no closed expression 
for the numbers i^T^\ However, Otter [3l] (see also [231 Section VII. 5]) derived the asymptotic 
enumeration result 

W>n ~ ^ , [^>) 

n— >oo n ' 

for some m-dependent constants Km > 0, Pm > ^- This can be achieved by studying the generating 
function 

V'("^)(x) = ^#tM^-, 

n>l 

which has a square-root singularity at the point l/pm- The behavior ([s]) indicates that a uniformly 
chosen element of Tn™"* should converge as n — )• oo, once renormalized suitably, to the Brownian 
continuum random tree. We show that this is indeed the case for any value of m. To state our 
result, let ^ 

T(r)={tGT(r):c0(t)<m-2}. 

For instance, Tn^ = for n > 2, while Tn°°'* = for all n. Let 



V;(-)(x) = ^#T 



(m) ri-n ^ 



n>l 



V2 



and define a finite constant Cm by 



m J 

Note that ^(■^^(x) = x for every x, while ip^°°\l/poo) = 1 123, Section VII. 5]. Therefore, we get 

V2p2 V2 



C2 



Theorem 4. Fix m G {2, 3, . . .} U {oo}. Let Tn be uniformly distributed in We view T„ as 

an element of by endowing it with the measure pT„! then 

1 (d) 

The proof of this result is given in Section [6j We note that this implies a similar, maybe more 
natural, statement for m-ary trees. We say that t G T is m-ary if every vertex has either m children 
or no child, and we say that the vertex is internal in the first case, i.e. when it is not a leaf. Summing 
over the degrees of vertices in an m-ary tree with n internal vertices, we obtain that #t = run + 1, 
and #(9t = {m — l)n + 1. 

Assume now that m < oo. Starting from an m-ary tree t with n internal vertices, and removing 
the leaves — equivalently, keeping only the internal vertices — gives an element (/)(t) G Ti™'^ The 
mapping (p is inverted by attaching m — k leaves to each vertex with k children, for an element of 
Tn"^\ Moreover, we leave as an easy exercise that dQ,Y{p{at,a(j){t)) < a for every a > 0, when the 
trees are endowed with the uniform measures pt, P(j,(t) on vertices. Theorem |4] thus implies: 
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Corollary 1. Let m G {2,3,...,} and Tn be a uniform m-ary tree with n internal vertices, 
endowed with the measure jU [m] . Then 



The problem of scaling limits of random rooted unlabeled trees has attracted some attention in 
the recent literature, see |13 | 129 ^ [T6]. For m = 2, Corollary [T] readily yields the main theorem of 
|29j . which was derived using a completely different method. Indeed, it is based in a stronger way 

(2) 

on combinatorial aspects of Tn . Here, we really make use of a fragmentation property satisfied by 
the laws P-L™^ • As alluded to at the begining of this section, these are not actually laws of Markov 
branching trees. Nevertheless, they can be coupled with laws of Markov branching trees in a way 
that the coupled trees are close in the dcHP metric. In the general case m 7^ 2, Theorem |4] and 
Corollary [1] are new, and were implicitly conjectured by Aldous j2]. In jl6j . the authors prove a 
result on the scaling limit of the so-called profile of the uniform tree for m = 00, which is related to 
our results, although it is not a direct consequence. Finally, we note that the problem of the scaling 
limit of unrooted unordered trees is still open, although we expect the Brownian tree to arise again 
as the limiting object. 

2.3 Consistent Markov branching models 

Considering again in a more specific way the Markov branching models, we stress that Theorem 
[1] also encompasses the results of |26], which hold for particular families {qnj'n > 1) satisfying a 
further consistency property. In this setting, it is assumed that qn{{n)) = for every n > 1, so 
that the trees do not have any vertex having only one child. The consistency property can be 
formulated as follows: 

Consistency property. Starting from T„ with n > 2, select one of the leaves uniformly 
at random, and remove this leaf as well as the edge that is attached to it. If this removal 
creates a vertex with only one child, then remove this vertex and merge the two edges 
incident to this vertex into one. Then the random tree thus constructed has same 
distribution as T„_i. 

A complete characterization of families {qnyn > 1) giving rise to Markov branching trees with 
this consistency property is given in [26] . Namely, such families can be constructed in terms of 
a pair (c, v), which is uniquely defined up to multiplication by a common positive constant, such 
that c > is an "erosion coefficient" and is a dislocation measure as described above (except 
that I'i'^Si < 1) = is not required). The cases where c = and I'iY^Si < 1) = are the most 
interesting ones, so we will assume henceforth that this is the case. The associated distributions 
Qn,n > 2 are given by the following explicit formula: for A G Vn having p >2 parts. 



n 





distinct 



where 



is a combinatorial factor, the same that 
normalizing constant defined by 





i>l 
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Assume further that v satisfies the following regularity condition: 



v{sx <\-e) =e-^£(l/e), 
where 7 G (0, 1) and £ is a function that is slowly varying at 00. Then 



(7) 



Theorem 5. If u is a dislocation measure satisfying and and if{qn,n > 1) is the consistent 
family of probability measures defined by then the Markov branching trees Tn, viewed as random 
measured M-trees by endowing the set of their leaves with the uniform probability measures, satisfies 



(d) 



r(l-7)n^£(n) n- 

for the Gromov-Hausdorff-Prokhorov topology. 

This theorem is in some sense more powerful than [26^ Theorem 2], because the latter result 
needed one extra technical hypothesis that is discarded here. Moreover, our result holds for the 
Gromov-Hausdorff-Prokhorov topology, which is stronger than the Gromov-Hausdorff topology 
considered in [26]. However, the setting of |26j also provided a natural coupling of the trees 
Tn, n>\ and 7^^,j/ on the same probability space, for which the convergence in Theorem [5] can be 
strengthened to a convergence in probability. This coupling is not provided in our case. 

Proof. Let s S 5^ be such that X^j>^ = 1- Let Ki, . . . , Kn be i.i.d. random variables in N 



such that F{Ki 



Si for every i > 1. Call A^'^\n) the number of variables Kj equal to 



and let A^^\n) = (^A^^\n) , A^'^\n) , ■ ■ , where x.-^ denotes the decreasing rearrangement of the 
non-negative sequence x = (xi,X2, • • .) with finite sum. It is not hard to see that the probability 
distributions Qn defined by ^ are also given, for A 7^ (n), by 

QnW = ^ [ P(A(^)(n) = A)z.(ds). 



See for example the forthcoming Lemma[5]in Section 3.2.4[ The normalizing constant Zn is regularly 
varying, according to the assumption of regular variation ([T]). Indeed, by Karamata's Tauberian 
Theorem (see |1H Th. 1.7.1']), we have that 



(l-s^)i/(ds) ~ r(l-7)zy(si < 1-1/n) = r(l-7)nT£(n). 



Now, to get a convergence of the form dsL note that for all continuous functions / : S'^ 



zy(ds)E 




/ Kds)(l-5i)/(s) 



which follows by dominated convergence, since / is bounded (say by K) on the compact space 
and 



E 



^ A^-^(n) \ / A(-)(n) 
n ) [ n ^ 

We conclude by applying Theorem [T] 



< KE 



n 



< KE 



A(i)(n) 



n 



K(l-s, 



□ 
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2.4 Further non-consistent cases: {a, 6)-trees 

Another application concerns a family of binary labeled trees introduced by Pitman and Winkel 
[53j and built inductively according to a growth rule depending on two parameters a E (0, 1) and 
^ > 0. Roughly, at each step, given that the tree j'^'^-'^^^ with n leaves branches at the branch point 
adjacent to the root into two subtrees with k > 1 leaves for the subtree containing the smallest 
label in j""'^'^^^ and n — k > 1 leaves for the other one, a weight a is assigned to the root edge and 
weights k — a and n — k — 1 + 9 are assigned respectively to the trees with sizes k, n — k. Then 
choose either the root edge or one of the two subtrees according with probabilities proportional 
to these weights. If a subtree with two or more leaves is selected, apply this weighting procedure 
inductively to this subtree until the root edge or a subtree with a single leaf is selected. If a subtree 
with single leaf is selected, insert a new edge and leaf at the unique edge of this subtree. Similarly, 
if the root edge is selected, add a new edge and leaf to this root edge. We denote by the tree 
rpa,e,ia,h ^j^jjQ^i^ labels. 

Pitman and Winkel show that the family {Tn'^,n > 1) is not consistent in general ( \33\ Propo- 
sition 1]), except when 6 = 1 — a or = 2 — a, and has the Markov branching property ( |33| 
Proposition 11]) with the following probabilities qn- 

• Qn {{k,n — k,0, ...))= qa0{n — 1, k) + qa.e{n — l,n — k), iorn — k<k<n— 1 

• qn in/2, n/2) = qa,e{n - 1, n/2), 
where 

, (n\a{n-k)+ekT(k-a)Vin-k + e) 

9a,e(n,A; = — , l<k<n. 

\k J n r(l — a)r(n + y) 

Now consider the binary measure v^fi defined on by Va,e{si + S2 < 1) = and Va,e{si £ dx) = 
fa,e[x)dx where fafi is defined on [1/2, 1) by 

■a-i^e-i 



faA^) = V (a (1 - ^) + Ox) x-"-i(l - + {ax + 6(1 - x)) (1 - x)" 

1 (1 — aj V 



Theorem 6. Endow, as usual, Tn'^ with the uniform probability measure on dTn'^ ■ Then, 

n n-¥oo 

for the rooted Gromov-Hausdorff-Prokhorov topology. 

This result reinforces Proposition 2 of [33] which states the a.s. convergence of in a certain 
finite dimensional sense, to a continuum fragmentation tree with parameters a,Va,e. In view of 
Theorem [Tl it suffices to check that hypothesis (H) holds, which in the present case states that for 
any / : 5^— )• M continuous and bounded with |/(s)| < (1 — si), 

n-i f k n — k \ 

V fi-,- ,0,...]qn{{k,n-k,0,...))^ f{x,l-x,0,...)fc,,e{x)dx. 

\n n / /i /9 

k=\nl'l\ ^ ^ ''^1^ 

To prove this, we use that x°-~'^{\ — x)''^^dx = r(a)r(6)/r(a + fe), and rewrite qa,e{n — 1, k) as 

\ k J n — 1 r(l — a)r(n — 1 + t^j Jo 

Then set for x £ [0, 1], 

F{x) := f{x, 1 - X, 0, . . .)l{a;>l/2} + /(I - X, X, 0, . . ■)l{x<l/2} 
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and note that F{0) = and < (1 — x) A x, for every x £ [0, 1]. We have, 



n-l 



E / 

k=\n/2] 



k n — k 



n n 



\ / k\ 

.0,...] qn{{k,n - k,0, . . .)) = Y,F [-) QaA^ - 



rfn- 1 



a 



r(l-a)r(n-l + i 

T{n-l + 9-a) 
r(l-a)r(n-l + ( 



I n— 1 

E 

fc=0 

1 

E 



k=0 

n-l\a{n-l-k) + ek ^ f k 
k I n — 1 



n 



a 1 



{x) ■ 



B. 



+ 



(x) ■ 
n— 1 



n — 1 n — 1 



F 



B 



(x) ■ 
n-1 



where B^^}^ denotes a binomial random variable with parameters (n — l,x). We can assume that 

B^]_i/n — )• X a.s. on the probability space {Q,F,F), and since F is continuous and bounded on 
[0, 1], we have 



E 



a 1 



B 



(x) ■ 

71-1 



B 



+ 



{x) ■ 
n-1 



n — 1 I n — 1 



F 



B. 



(x) ■ 
n-1 

n 



{a (1 — x) + Ox) F{x) , for every x G [0, 1]. 



Moreover, 



E 



a 1 



B, 



(x) ■ 
n-1 



B 



+ 



(x) ■ 
n-1 



n — 1 I n — 1 



F 



B. 



n-1 



n 



n-1 



n 



A qE 



1 



< |^(a + ^)E 

< ((a + 6l)x) A(a(l-x) + (9x). 
This is enough to conclude by dominated convergence that 



(x) 
n-1 



n — 1 



+ eE 



-"n-1 

n — 1 



E 



I ^n-l \ p ^n-1 

n — 1 I n — 1 I \ n 



(x) 



a 1 



x-°-^(l-x)'-Mx 



/ (a(l -x) + ^x)F(x)x~"-^(l -x)^-Mx 
Jo 

r(l - a) /(x, 1 - X, . . .)/Q,e(2;)dx. 

Jl/2 



Last, Stirling's formula implies that 



T{n-l + 9-a) 
T{n-l + l 



~ n 

n— >oo 



as wanted. 



3 Preliminaries on self-similar fragmentations and trees 
3.1 Partition- valued self-similar fragmentations 

In this section, we recall the aspects of the theory of self-similar fragmentations that will be needed 
to prove Theorems [T] and [2I We refer the reader to [9] for more details. 
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3.1.1 Partitions of sets of integers 

Let C N be a possibly infinite, nonempty subset of tlie integers, and vr = {vri, 7r2, . . .} be a 
partition of B. The (nonempty) sets vri, 7r2, . . . are called the blocks of tt, we denote their number 
by 6(7r). In order to lift the ambiguity in the labeling of the blocks, we will use, unless otherwise 
specified, the convention that TTi,i > 1 is defined inductively as follows: vr, is the block of vr that 
contains the least integer of the set 



-1 



i=i 

if the latter is not empty. For i £ B, we also let Tr^j) be the block of vr that contains i. 

We let Vb be the set of partitions of B. This forms a partially ordered set, where we let n ^ tt' 
if the blocks of vr' are all included in blocks of vr (we also say that tt' is finer than vr). The minimal 
element is = {B}, and the maximal element is = {{i} : i £ B}. 

If B' C B is nonempty, the restriction of vr to -B', denoted by vrj^/ or S'nvr with a slight abuse of 
notations, is the element of Vb' whose blocks are the non-empty elements of {B' n vri, i?' H vr2, . • •}• 

If C N is finite, with say n elements, then any partition vr G Vb with b blocks induces 
an element A(vr) G Vn with b parts, given by the non-increasing rearrangement of the sequence 
(#vri,...,#vrfe). 

A subset S C N is said to admit an asymptotic frequency if the limit 

#{Bn[n]) 



lim 



n 



exists. It is then denoted by \B\. It is a well-known fact, due to Kingman, that if vr is a random par- 
tition of N with distribution invariant under the action of permutations (simply called exchangeable 
partition), then a.s. every block of vr admits an asymptotic frequency. The law of vr is then given 
by the paintbox construction of next section, for some probability measure u. We let |vr|-^ G 
be the non-increasing rearrangement of the sequence (|vrj|,z > 1). The exchangeable partition vr is 
called proper if Yl'i=i Wi\ — 1) which is equivalent to the fact that vr has a.s. no singleton blocks. 



3.1.2 Paintbox construction 

Let be a dislocation measure, as defined around Q. We construct a a-finite measure on Vf^ by 
the so-called paintbox construction. Namely, for every s G 5^'' with X^j>i Sj = 1, consider an i.i.d. 
sequence {Ki,i > 1) such that 

W>(Ki = k) = Sk, k>l. 

Then the partition vr such that i,j are in the same block of vr if and only if Ki = Kj is exchangeable. 
We denote by Ps(dvr) its law. Note that ps(dvr)-a.s., it holds that |vr|^ = s, and vr is a.s. proper 
under ps if and only if s sums to 1. The measure 

Kuidir) := / z^(ds)/)s(dvr) 

is a cr- finite measure on V^m, invariant under the action of permutations. From the integrability 
condition ([2]) on z^, it is easy to check that for /c > 2, if 

Ak = {tt e Vn : 7r\ik] ^ {[k]}} 
is the set of partitions whose trace on [k] has at least two blocks, then 

Ku{Ak) = [ Hds) (l-Y. (8) 
for every > 2, since 1 — X]i>i s\ <\ — s\ < k{\ — s\). 
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3.1.3 Exchangeable partitions of finite and infinite sets 

In this section, we establish some elementary results concerning exchangeable partitions of [n] or 
N. The set of partitions with variable size, namely 

n>l 

is endowed with the distance 

dp{TT,TT') = exp(-sup{A; > 1 : 7r|[fc] = vr'|[fc]}) . 

In the sequel, convergence in distribution for partitions will be understood with respect to the 
separable and complete space {V,dp). We will use the falling factorial notation 

T(x + 1) 

[x)n = x{x — 1) . . . (x — n + 1) — 



r(x-n + l) 



for X a real number and n £ N, n < x + 1. When x G N, we extend the notation to all n £ N, by 
setting {x)n = for n > x + 1. 

Lemma 1. Let vr be an exchangeable partition of [n], and let k < n. Then for every B C [k] with 
I elements such that 1 £ B, 

P {[k] n 7r(i) = B I #7r(i)) = 



(n-i; 



fc-1 



Proof. By exchangeability, the probability under consideration does depends on B only through 
its cardinality, and this equal to P (^2, ■ ■ ■ ,ii G ■ ■ ■ ,jk-i ^ '^(i)l#'^(i)) for pairwise dis- 

joint 12, ■ ■ . ,ii, ji, ■ . . , jk-i £ {2,3, ... ,n} (note that there are ("Ji ) (^Z|) such choices). Conse- 
quently, 



E 

P([fe]n7r(i) =S|#7r(i)) = - 



Yl i2,---,il l{«2v,«i67r(i)}l{ji,...jfc_;^7r(i)} I #^(1) 



l-l !\ k-l 



u-i; \k-U 

where the sum in the expectation is over indices considered above. This yields the result. □ 

Lemma 2. Let (vr'-^^n > 1) be a sequence of random exchangeable partitions respectively in 'P\n\. 
We assume that tt^"^ converges in distribution to vr. Then vr is exchangeable and 

^ ! 

the latter convergences holding jointly for i > 1, and jointly with the convergence tt^^^ — )■ vr. 

Proof. The fact that vr is invariant under the action of permutations of N with finite support is 
inherited from the exchangeability of Tr^"'^ and one concludes that vr is exchangeable [3]. 

The random variables O^ir^^^ /n,i > 1) take values in [0, 1], so their joint distribution is tight, 
and up to extraction, we may assume that they converge in distribution to a random vector i > 
1), jointly with the convergence vr^") — )• vr. We want to show that a.s. = |vr(j)|, which will 
characterize the limiting distribution. 
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For A; > / > 1 fixed, by summing tlie formula of Lemma [T] over all B C [k] containing i, with I 
elements, we get 

k-l\ (#vrg)-l)^-i(n-#vr;"))fc_^ 



)fc-i 



which entails that, conditionally on x^^-j, #([A;] n 7r(j)) — 1 follows a binomial distribution with 
parameters {k — Therefore, 

|vr(,)|=^lnn = a.s., 

by the law of large numbers. □ 

Lemma 3. Let (vr^^^ 1 < i < r) be a sequence of random elements ofVn, which is exchangeable in 
the sense that (crvr'^*^, 1 < i < r) has same distribution as (vr^*), 1 < i < r), for every permutation a 
o/N. Then for every k >2, 

P(2, 3,...,ke vrfJJ I Ivrg 1, 1 <i<r,j>l)= Ivrfjjl'^-^ . 
Proof. Let n > k and set 

^ii,n) ^ 7r«|[„], SO that (7r(*'"),l < i < r) is a random sequence of "Pr^i 
that is exchangeable. Then, by using the same argument as in the proof of Lemma [T| it holds that 

/ u {l,n) _ 1 \ 

P(2, 3, . . . , G 7r(\) I i^TT^.^ \l<i<r,l<j <n) = _ . 

Using Lemma and the fact that (Tr^*'"^ converges in distribution to (vr^*^ 1 < i < r) 

as n — )• 00, it is then elementary to get the result by taking limits. □ 



3.1.4 Poisson construction of homogeneous fragmentations 

We now recall a useful construction of homogeneous fragmentations using Poisson point processes. 
We again fix a dislocation measure v. 

Consider a Poisson random measure A/'(dM7rdi) on the set M+ x Vn x N, with intensity measure 
dt ® «;,y(d7r) ® #pj(di), where #pj is the counting measure on N. We use a Poisson process notation 
(ttj , )i>o for the atoms of M: for t > 0, if (t, vr, i) is an atom of M then we let (ttj , ) = (tt, i), and 
if there is no atom of M of the form (t, vr, z), then we set vrf = and = by convention. One 
constructs a process (n°(t), t > 0) by letting n'^(O) = On, and given that n''(s), < s < f has been 
defined, we let n'^(t) be the element of obtained from Ii^{t—) by leaving its blocks unchanged, 
except the i?-th block Y^ait—), which is intersected with vrP. Of course, this construction is only 
informal, since the times t of occurrence of an atom of A/ are everywhere dense in M-|-. However, 
using ([s]), it is possible to perform a similar construction for partitions restricted to [A;], and check 
that these constructions are consistent as k varies [9, Section 3.1.1]. The process {Ji^{t),t > 0) is 
called a partition-valued homogeneous fragmentation with dislocation measure vr. 

Note in particular that the block 11^^^^ (t) that contains 1 at time t, is given by 

n?i)(t)= n (9) 

0<s<i 
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and that the restriction of N to M-|- x Vn x {1} is a Poisson measure with intensity dt ® Kp{dT:). 

For k >2, let Dl = inf{t > : n°(t) G Ak] be the first time when the restriction of n°(t) 
to {k\ has at least two blocks. By the previous construction, it is immediate to see that Z)^ 
has an exponential distribution with parameter Ki,{Ak): it is the first time t such that = 1 
and S Ak- Moreover, by standard properties of Poisson random measures, conditionally on 

= s, the random variables vr^ and (ttj , )o<t<s are independent, and the law of vr^ equals 
i^u{-\Ak) = n Ak) / K.v{Ak), while (vr^, i°)o<t<s has same distribution as the initial process 
conditioned on {(7r°,i°) ^ A^ x {1},0 <t<s} = {D^ > s}, which has probability e"'"*''^^''-). It is 
also equivalent to condition on {D^ > s}, since "^{D^ = s) = 0. The next statement sums up this 
discussion. By definition, we let X{t A s—) = X{t)l^)-^gj + X{s—)l^fygj for X cadlag. 

Lemma 4. Let F, f be non-negative measurable functions. Then 



E 



F{U'{tADl-),t>0)fi7r'^o) 

k . 
/■OO 

= HuiflAk) K^(ylfc)dsE[F(n°(tAs),t >0)1, 
Jo 



{Dl>s}\ ■ 

Otherwise said, vr^o and (n°(t A D^ — ),t > 0), are independent with respective laws \ A^), and 

the law of (n'^(t A <E),t > 0) where e is an exponential random variable, independent of , and 
with parameter Kiy{Ak). 



3.1.5 Self-similar fragmentations 

From a homogeneous fragmentation constructed as above, one can associate a one-parameter 
family of "Ppj-valued processes by a time-changing method. Let a G M. For every i > 1 we let 
{T^-'^{t),t > 0) be defined as the right-continuous inverse of the non-decreasing process 

Jo 

For t > 0, let n(t) be the random partition of N whose blocks are given by n|'.j(T^'J^(t)), i > 1. 
One can check that this definition is consistent, namely, that for every j G n^.^(r^'^^(t)), one has 

n?.)(-«(*)) = n?.)(-o)W). 

The process (n(t),t > 0) is called the self-similar fragmentation with index a and dislocation 
measure v [9j, Chapter 3.3]. We now assume that a = —7 < is fixed once and for all. Let 
Dk = inf{i > : U{t) e Ak}. 

Proposition 5. Conditionally given a{Il^^'j(t A Dk) ■ t > 0,1 < i < k}, and letting vr = Il{Dk), 
the random variable {Ili{t + Dk),t > 0)i<j<;,([fc]pi^) has same distribution as (vTj n n(*)(|7rj|"t), t > 
0)i<j<6([fc]n7r)) where (Il^^^i > 1) are i.i.d. copies ofU. 

Proof. For every z > 1 we let Lj = mi{t > : U(^i){t) D [k] / [k]}. Then L = {Li,i > 1) is 
a so-called stopping line, i.e. for every i > 1, Li is a stopping time with respect to the natural 
filtration of H^j), while Li = Lj for every j G n(j)(Lj). We let n(L) be the partition whose blocks 
are n(j)(Lj),i > 1 — by definition of a stopping line, two such blocks are either equal or disjoint. 
Note that t + L = (t + Li, i > 1) is also a stopping line, as well as t A L = (t A Li, i > 1). 

From the so called extended branching property [9l Lemma 3.14], we obtain that conditionally 
given o"{n(t f\ L),t > 0}, the process (Jl{t + L),t>0) has same distribution as 

(Knn«(|7r,rt)),i > >o), 

where tt = n(L) and (nW,z > 1) are i.i.d. copies of H. The result is then a specialization of this, 
when looking only at the blocks of H that contain 1,2, ... ,k. □ 
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It will be of key importance to characterize the joint distribution of D^, (n(j)(Z)fe), 1 < i < k). 
This can be obtained as a consequence of Lemma |4j Recall the construction of 11 from II'', let 
r(j) = T^^y and define vrj = vr^^^^^^^. The latter is equal to "^^^ ^(f) every i G [k] and t < Dk- 

Proposition 6. Let F, f be non-negative, measurable functions. Then 



E 



F(|n(i)(tAZ)fc-)|,t>0)/(7rz3, ^ 



Proof. By definition, Dk (resp. Z)^) is the first time when [k]nU{t) / [A;] (resp. [k]nU^{t) ^ 
It follows that = T(i-^{Dk), and that the process 

n(i)(t A Dk-) = nOi)(T(i)(t A Dk-)) = nfi)(r(i)(t) A Dl-) , t>0 

is measurable with respect to a{Ii^^~^{t A D^—),t > 0}. Lemma |4j implies that 



E 



F(|n(i)(tAi?fe-)|,t>0)/(7rB, 
= E[F(|nOi)(T(i)(t) A Dl-)\,t> 0)/(vr^o) 

POD 

= ^^uif\Ak)J^ dsK,{Ak)E[F{\Ul^{^,)it)As)\,t>0)l^oo^,y 

= K.(/|Afc)E / d^x/t,(^fe)|n(i)(n)rF(|nO (T(i)(t) Ar(i)(n))|,t >0)l|z),>n} 
.Jo 

' POO 

= K,if\Ak)E / d^x/t,(^fc)|n(i)(n)rF(|n(i)(tAn)|,t>0)l|z3,>„} 



where in the third equality, we used successively Fubini's theorem and the change of variables 
s = r(i)(ti), so that ds = |n(i)(n)|"dM. We conclude by using the fact that 



»(z)fe >n||n(i)(t)|,0<f <u) = !%)( 



(10) 



which can be argued as follows. Let < ti < ^2 < • • • < = "U be fixed times, then by applying 
Lemma [3] to the sequence (n(tj), 1 < i < r), we obtain that 



Dk > n| |n(i)(ti)|,l < i<r]= |n(i)( 



ifc-i 



This yields (10) by a monotone class argument, using the fact that cr{ | IIj-x) (t) |, < i < u} is 
generated by finite cylinder events. □ 

The last important property of self-similar fragmentation is that the process (|n(i)(t)|,t > 0) 
is a Markov process, which can be described as follows [5]. Let > 0) be a subordinator with 
Laplace transform 



E 



[exp(-r6)] = exp - t (l - Y^s^^')"^^^)) ■ 



Then (|n^^-j(t)|, t > 0) as same distribution as (exp(— > 0), and consequently, the process 
(|n(x)(t)|,t > 0) is a so-called self-similar Markov process: 

Proposition 7 (Corollary 3.1 of [S]). The process {\Il(^i'j{t)\,t > 0) has same distribution as 
exp(— > 0), where r is the right- continuous inverse of the process [f^ ex.p{a^s)ds,u > 0). 
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3.2 Continuum fragmentation trees 

This section is devoted to a more detailed description of the limiting self-similar fragmentation 
tree T^^y [^. In particular, we will need a new decomposition result of reduced trees at the first 



branchpoint (Proposition 10) 



3.2.1 Trees with edge-lengths and R-trees 



We saw in Section 1.3.1 how to turn a tree into a (finite) measured metric space. It is also easy to 
"turn discrete trees into M-trees" , viewing the edges as real segments of length 1 . 

More generally, a plane tree with edge-length is a pair 9 = (t, {lu, u £ t)) where iu > for every 
u G t, and a tree with edge-lengths is obtained by "forgetting the ordering" in a way that is adapted 
from the discussion of Section |1.1| in a straightforward way. Namely, the plane trees with edge- 
lengths (t, {iu, u G t)) and (t', (£'„, u G t')) are equivalent if there exist permutations a = ((T„, u G t) 
such that at = t' and I', \ = ^u, for every u £ t. We let be the set of trees with edge- 
lengths, i.e. of equivalence classes of plane trees with edge-lengths. There is a natural concatenation 
transformation, similar to (•), for elements of 0. Namely, if 0^*^ = (t^*\ {iu\u G t)), 1 < i < A; is a 
sequence of plane trees with edge- lengths and £ > 0, let 

{e^'\i<i<k)e = {t,{eu,uet)), 

be defined by 

t = (t(^),l <i<k), 

and 

If we replace each 0^"^^ by another equivalent plane tree with edge-lengths, then the resulting con- 
catenation is equivalent to the first one, so that this operation is well-defined for elements of 

Let 6 £ @, and consider a plane representative (t, {iu,u £ t))- We construct an M-tree T by 
imagining that the edge from pr(it) to u has length iu- Note that this intuitively involves a new 
edge with length £0 pointing from the root p of the resulting M-tree to (this is sometimes called 
planting). Formally, T is the isometry-equivalence class of a subset of M* endowed with the /^-norm 
||(xm,u G t)||i = X^^g^ \xu\, defined as the union of segments 



where {eu,u G t) is the canonical basis of M* and v ^ u means that w is a strict ancestor of u in 
t. This M-tree is naturally rooted at G M*. Of course, its isometry class does not depend on the 
choice of the plane representative of 6, and can be written 7~(^) unambiguously. Note that there is 
a natural "embedding" mapping tit—)- T{9) inherited from 

LQ-.t^T, io{u) = e^ey , (11) 

and the latter is an isometry if 9 is endowed with the (semi-)metric dg on its vertices, defined by 

dg{u,v)= ^ iu,, 

where xor denotes "exclusive or" . 

Conversely, it is an elementary exercise to see that any rooted M-tree T with a finite number of 
leaves can be written in the form T = T{9) for some G 0, which is in fact unique. In the sequel. 
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we will often identify the tree 9 ^ & with the M-tree T^O). For instance, this justifies the notation 
{T^^\ T^''^)e for M-trees T^^\ . . . , T^''^ with finitely many leaves and for ^ > 0, which stands for 
the M-tree in which the roots of ■ ■ ■ jT*^*^-* have been identified, and attached to a segment of 
length £ to a new root. 

With a discrete tree t , we canonically associate the tree with edge-lengths 9 in which all lengths 
are equal to 1, and the rooted M-tree T(t) = T{9). In this case, dg = dgj. is the graph distance. 
Using the isometry t : 1 1— T(t), we get the following statement, left as an exercise to the reader. 



anc 



Proposition 8. Viewing t G T as the element (t, dgr, p, ^ugt) of as in Section 1.3.1 
endowing T(t) with the uniform probability distribution on C{T{t)), it holds that 

'^GHp(at, a7'(t)) < a , a>0. 

Due to this statement, in order to prove that the Markov branching tree T„ with law 
converges after rescaling towards T'y^u, it suffices to show the same statement for the M-tree T(Tn). 
We will often make the identification of Tn with T{Tn). 



3.2.2 Partition-valued processes and M-trees 

Let (7r(t), t > 0) be a process with values in C C N, finite or infinite, which is non-decreasing and 
indexed either by i G Z+ or t G M+, in which case we also assume that 7r(-) is right-continuous. We 
assume that there exists some to > such that vr(to) = Ic- Let B C C he finite. If S = {i}, we let 

Dl^y = mf{t >0:{i}G vr(t)} 

be the first time where i is isolated in a singleton block, and for > 2, let 

Dl = inf{t >0:Br\ 7r(t) / B} . 

We can build a tree with edge-lengths (and labeled leaves) 9{7r{-),B) by he following inductive 
procedure: 

1. U B = {i} we let 9{tt{-),B) be the tree • with length DJ.^ 

2. if #B > 2, we let 

9{7r{-),B) = {9{tt{DI + -),Bn MDl)), l<i< h)Di , 

where b is the number of blocks of 7r{D^) that intersect B, and which are denoted by 
7ri(D5),...,7rb(Z)5). 

Note that the previous labeling convention for blocks may not agree with our usual convention of 
labeling with respect to order of least element. 

If {7r{t),t £ Z+) is indexed by non-negative integers, and satisfies 7r(0) = Oc", there is a similar 
construction with trees rather than trees with edge-lengths. Namely, we let t^(.-) be defined by 

1. t^(.) = • if #C = 1, and 

2. t^(.) = (t,r,(i)n7r(-+i)i 1 < ^ < &) otherwise, where b is the number of blocks of vr(l), denoted 
by vri(l),...,7rb(l). 

It is then easy to see that, with the notations of the previous section, 

nt^i.)) = T{9{7r{.),C)), (12) 

And one can view 9{tt{-),B) as the subtree of t^(.) spanned by the root and the leaves with labels 
in B. 
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3.2.3 Continuum fragmentation trees 

Let (n(t),t > 0) be the self-similar fragmentation process with index —7 < and dislocation 



measure v. The formation of dust property alluded to in Section 1.4 amounts to the fact that 
almost-surely, there exists some time to > such that 11 (t) = 1^ for every t > Iq. Consequently, 
the construction of the previous paragraph applies with C = N, and allows to construct a family 
of M-trees 

indexed by finite subsets B C N. Recall that a tree E has been identified with T{0) G 
These M-trees have finitely many leaves that are naturally indexed by elements of B. Moreover, 
they satisfy an obvious consistency property, meaning that taking the subtree spanned by the root 
and the leaves indexed hy B' G B yields an M-tree with same law as TZb'- This is the key to the 
definition of the fragmentation tree 7^,u- 

Proposition 9 ([24J). Conditionally given Ty^u = {T^jd, p, n), letLi,L2,.-- be an i.i.d. sequence 
of leaves ofT distributed according to fi. Then for every finite B CN, the reduced subtree 



n{Ty,,,B) = [J[[p,L,]] 



has same distribution as TZb- 

Moreover, the law of Ty^u is the only one having this property, among distributions on that 
charge only the set of {{T, d, p, p) G : V a; G T, 2; ^ ^[T) =^ p{Tx) > 0}. 

As an easy consequence, we have the following "converse construction" of fragmentations from 
Tj,u- With the notation of the proposition, for every t > 0, let n(t) be the partition of N such that 
i,j are in the same block of n(t) if and only if d{p, Li A Lj) > t. Then (n(t), t > 0) is a self-similar 
fragmentation process with dislocation measure i' and index —7. 

Also, note that the reduced trees TZ{Ty^p, B) rooted at p and endowed with the empirical measure 

^ ieB 

converge in distribution as ^B — t- 00 in towards (T, d, p, p). In fact, the convergence holds a.s. 
if B = [k] with k — t- 00: this is a simple exercise using the fact that {Lj, i > 1} is a.s. dense in C{T) 
(by property 3. in the definition of 7^,jy), and the weak convergence of p^^j to p as k ^ 00. 

The following statement gives a decomposition of the reduced tree Tl{T, [k]) at its first branch- 
point above the root. Recall the notation Dk = inf{t > : n(t) G ^a:}- 

Proposition 10. Let k >2 and vr = Il{Dk),Tr' = 7r\k,b = b{'K'). Then conditionally on {ii^Dk}, 
the reduced tree 7^(7^^i,, [k]) has same distribution as 



T({\7Ti\''n{T^'\7T',),l<i<b) 



where the T^^^ are i.i.d. with same distribution as T^^v, independent of a{Tr, D/^} . 

Moreover, for every i G N, the tree TZ{T'y,u, {i}) has same distribution as the M-tree associated 
with the tree {0,Di) G 0, i.e. a real segment with length Di = inf{t > : {1} G n(t)}. 

Proof. The second statement is just a matter of definitions, so we only need to prove the first 
one. By Proposition [sj the process n(Z)fc + in restriction to the blocks containing at least one 
element in [k], has same distribution as the partitions- valued process whose blocks are those of 
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7rinnW(|7ri|-'>-),l <i<b, for i.i.d. copies Il^^^i > 1 of H, independent on vr,Dfc. Therefore, one 
gets from the definition oiTZs that 

7^[fc/=V(0(^»(Kr^•),v^O, 1 < i < &)d, , 

from which the result follows immediately. □ 

Note that Proposition [6] gives the joint distribution of Dfc,|7rj|,l < i < vr' as a special 
case, while Proposition [7] characterizes the law of Di, since it is the first time where the pro- 
cess (|n(x)(t)|,t > 0) attains 0. This, together with the previous proposition, allows to characterize 
entirely the laws of the reduced trees of hence the law of Ty^u itself. 



3.2.4 Markov branching trees as discrete fragmentation trees 

Recall the informal description of Markov branching trees in the introduction, relying on col- 
lections of balls in urns. Rather than collections of indistinguishable balls that split randomly, it is 
convenient to consider instead a collection of balls that are distinguished by a random, exchange- 
able labeling. This is achieved by replacing partitions of integers by partitions of sets. We start 
with a preliminary lemma. 

Lemma 5. Let n > 1 be fixed, as well as a partition X G Vn with p = p{X) parts. 

(i) There are 

^ ^ ri^ 

' nLiA.!n-=i"^.(A)! 

partitions vr E V^n] such that A(7r) = A. 

(ii) If I < k < n and vr' G "Pj^j has b blocks, then for every ii, . . . , i;, G {1,2, ... ,p} pairwise distinct, 
there are 



1 ^ 

Cl'{iu...,ib) = Cx-^ll{X 



partitions it G V^^] such that X{tt) = X, 7r|[^] = vr' and ^ttj = Xi., 1 < j <b. 

Proof. Let p be the number of parts of A. Then there are p!/ 11^=1 mj{Xy. sequences (ci, . . . , Cp) 
whose non-increasing rearrangement is A. With any such sequence, we can associate 



sequences of the form . . . , Bp) such that {Bi, . . . , Bp} is a partition of [n] (beware that the 
labeling of the blocks Bi will differ in general from labeling convention described above for the blocks 
of a partition), with = Ci,l < i < p. Finally, exactly p\ sequences of the form {Bi, . . . , Bp) 
induce the same partition {Bi, . . . ,Bp}. Putting things together easily yield the formula for Cx. 

For the second formula, if A G 7^n,,7r' G V[k] and ii,. . . ,ih are given with b = b^ir'), then any 
partition vr G V[n] with A(7r) = A and tt\[^ = n' must have 7ri\[k] = {'^\[k])i = tt-, for 1 < z < 6, the 
first equality coming from our choice of the labeling of blocks of partitions. The restriction of tt 
to [A;] is thus entirely determined, while the restriction of vr to [n] \ [k] is a partition of the latter 
set whose block-sizes are given by the sequence Aj, i ^ {ii, . . . , it,}, A,^. — #7r^-, 1 < j < b. A simple 
adaptation of point (i) shows that there are 



ni<(., i.)A,!n;.i(>i, -#';)! 

such partitions. □ 
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Figure 2: An sample tree T[„] for n = 11, with the labeled leaves. The process can be easily 

deduced: for instance, n(ii)(l) = {{1, 2, 6, 7}, {3, 5, 10}, {4, 8, 9, 11}}. As opposed to Figure |ll 
leaves are all connected to vertices with at least 2 children, because of the requirement gi(0) = 1. 



Going back to Markov branching trees, let -B C N have n > 2 elements. Let q = {qn,n > 1) 
satisfy ([l]), and also assume that (7i(0) = 1. For every vr G Vb, set 

gn(A(7r)) 

Pb[V = —^ , (13) 

where Cx is the constant appearing in Lemma [sj Given the partition of n that it induces (which 
has distribution Qn), a ps-distributed partition is thus uniform among possible choices of partitions 
of B. In particular, a random partition with distribution ps is exchangeable, i.e. its law is invariant 
under the action of permutations of B. By convention, the law ps, if B = {i} is a singleton, is the 
Dirac mass at the partition {{i}}. 

For every tt S Vb with blocks vri, 7r2, . . . , tt^ say, consider random partitions tt^,1 < i < k 
of TTi , . . . , TTfc respectively, chosen independently with respective distributions Pt^-^ , ■ ■ ■ , Pizk ■ 

We let 

Q{tt, •) be the law of the partition IJi<i<fc^* ^ ^b- Then Q is the transition kernel of a Markov 
chain on Vb (for any finite i? C N), that ends at the state I^. It is easily seen that this Markov 
chain is exchangeable as a process. Moreover, the chain started from the state {B}, with ^B = n 
has same distribution as the image of the chain started from [n] under the action of any bijection 
[n] ^ B. 

For finite C C N, we let (11'^ (r), r > 0) be the chain with transition matrix Q and started from 
n'^(O) = Oc- P lainly, is non- decreasing and attains Ic in finite time a.s., so the construction 



of Section 3.2.2 applies and yields a family 6{Ii^ {■),B) G 0,i? C C, as well as a tree Tq 
tnc(.). By construction, given that n'-^(l) has blocks 7ri,...,7rb, the trees t^.|-|nc^(-+i)) 1 < ^ < 
are independent with same distribution as T.,^. ,i < i < b respectively. Since the law of the non- 
increasing rearrangement of #7rj, 1 < i < b is q^Ci we readily obtain the following statement]^ 

Lemma 6. The tree Tc has law P^^"- 



^There is one subtlety in this statement, which is in the case C = {i} for some i G N. Indeed, by construction we 
have Tc = • a.s., and this is the only place where we have to require that gi(0) = 1. 
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In fact, the leaves of the tree Tc are naturally labeled by elements of C. We will use it in the 
sequel, without further formalizing the notion of trees with labeled leaves. 

We will also use the shorthand notation for the reduced tree 6(J1'-' {■),B). Using the above 
description, and applying the Markov property for 11'^ at time and the particular form of the 
Markov kernel Q, we immediately obtain the following, in the particular case B = [k]^C = [n]. 

Proposition 11. Let 2 < k < n. Then, conditionally on -Dj^j"' = i and n["](Dj^j"') = vr, with 
7r|[fc] = vr', it holds that T^^ has same distribution as 

(e«,l<i<5(vr'))£, 

where 9^^\ 1 < ^ < ft are independent with respective laws that of T^l, 1 < i < b{'K'). 

Proof. The only subtle point is that [A;] Plvrj = tt^, 1 < i < 6, since the labeling of the blocks of vr, vr' 
could differ. But since these partitions are respectively of [n] and [k], this cannot be the case. □ 



4 Proofs of Theorems [T] and [2] 

Let q = {qn,n > 1) be a sequence of laws on Vn respectively, that satisfies ([T]) and (H), for some 
fragmentation pair (—7, i') and some slowly varying function i. In order to lighten notations, we 
let On = n'^i{n). 

As we noticed in the introduction, it is easy to pass from the situation where gi(0) = 1 to the 
general situation, by adding independent linear strings with Geometric((7i(0))-distributed lengths 
to the n leaves of Ta- Since geometric distributions have exponential tails, the longest of these n 
strings will have a length at most Clogn with probability going to 1 as n — )• 00, for some C > 0. 
If we let be the tree for which qi{(l>) > and T^ the one for which qi{(l>) = 1, coupled in the way 
depicted above, we easily get, for any 7 > 0, 

F{dGMa-'Tla-'T^) < Ca;,'logn) 1. 

n— >oo 

Thus, we can deduce the convergence in distribution of a^^T^ to Ty^u from that of a^^T^. Therefore, 
from now on and until the end of the present section, we make the following hypothesis, which will 
allow us to apply Lemma [6j 

(H') The sequence {qn,n > 1) satisfies (H) and (/i(0) = 1. 



4.1 Preliminary convergence lemmas 

We now establish a couple of intermediate convergence results for the discrete model. Recall that 
the sequence of distributions qn,n > 2 on Vn respectively induce distributions pB onVs for finite 
B by formula (13). By convention we set pn = P[n]- 

Lemma 7. Let k > 2 and let vr' be an element in V^k] with b blocks, b > 2. Let g : (0, 00)^ — )• M 6e 
a continuous function with compact support. Then, under assumption (H'), 

anPn(yg{^, . . . , ^) lW[fe,=^'}) K^(dvr)5(|vri|, . . . , |vrb|)l|^|j^j=^,| , 

where Ku is the paintbox construction associated with u. Note that on the event {vr|[fc] = vr'}, the 
quantities ^vrj/n and |vrj| for 1 <i <b that appear above are a.e. non-zero, respectively under pn 
and Ky. 
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Proof. For simplicity, we let 



An = Pn 

By the definition of qn and Lemma [5| 



, . . . , 

n n 



{T|[fe)=^'} 



Now, the function 



ji,...,jft>i 

pairwise distinct 



pairwise distinct 



b 



h{s) 



ii ' 



s e 5^ 



h,...,ib>l 
pairwise distinct 



is continuous and bounded, because g is compactly supported in (0, oo)^, so that the sum is really 
a finite sum. Moreover, 



0<ki,k2,---<k ^^^-^ ■' j>l 
ki+k2+...=k 



(14) 



where K is an upper-bound of \g\, and for every A G Vn, it is easily checked that for large n, if 
e > is such that g{xi, . . . ,Xh) = as soon as mini<j<bXj < e. 



1 



k \k 



en 



h{\/n) < 



1 



E 



A,; 



_ g(^,.../-^ 

pairwise distinct 
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J \n — k 



h{X/n) 



Letting n — )• oo and applying (H'), which is validated by (14), 



lim a„A„ = / v{ds)h{s)=j K;^(d7r)g(|7ri|, . . . , |7r;,|)l|^L 
J Si JVf, " 



the latter equality being a simple consequence of the paintbox construction of Section 3.1.2 □ 
No w, we a ssociate with {q-mn > 1) a family of process (Jl^{r),r > 0) with values in Vb, as in 



Section 



3.2.4 



We let n" = W""' for simplicity, and set 

D," = D\i" = inf{r >0:[k]n n"(r) / {[k]}} 



for 2 < A; < n, and D"^ = L>J\'J = inf{r > : {1} G n"(r)}. Also, for r > we let 



X„(r) = #n^,)(r). 



Lemma 8. Let n. A; G N be fixed, with n > k > 2, and let vr' G V^k] have b > 2 blocks. Let F, f be 
measurable non-negative functions. Then 



E 



• Apf,] - l)))/(#nr(Z)p,]),l < i < 6)l|[fc]nnn(Dp,,)=.'} 



E^ 

r'>0 



(X„(r' - 1) - l)fe_i 



(n-1) 



fc-i 



F(X„(- A (/ - 1))) Px^ir'-l){f{#^^^, l<i< b)l{^. ,y) 
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Proof. We first consider an expression of a more general form. For non-negative functions G, g, 
we have, using tlie Markov property at time r' — 1 in the second step, 



E 



G(n"(. A pf,] - i)))5(nf,)(i)p,] - 1) nn"(Z)p,])) 

= ^ E[G(n"(. A (/ - l)))l|[fc]cn^^^(,,_i)|<7(nf,)(r' - 1) n n"(r'))l{[fc]nn"(r')^{[fc]}} 

r'>0 

^ E[G(n"(- A (/ - l)))l{[fc]cn^^j(r'-i)}Pn^jj(r'-i)(5(vr)l{[fc]n7r^{[fc]}}) 



r'>0 



Speciahzing this formula to G depending only on X„ and g^n) = /(#vri, . . . ,^TTb)l^^\^^^^^fy, and 
using obvious exchangeability properties, we obtain 



E 



F{Xn{- A (Z)f,] - l)))/(#nni?ffc]), 1 < i < 6)l{[fc]nn"(Dp,,)=.'} 
= e[f(X,(. a (r' - 1 < ^ < &)l{^lw=^'})l{Wcnj\)(r'-i)} 



r'>0 

All the terms in the expectation depend on (X„(r),0 <r<r' — l), except the last one which is a 
function of n"-^^(r' — 1). But by Lemmajlj (in fact, the variant used in the proof of Lemma jsj), 

F{[k] C U^^ir' - 1) I (X„(r),0 < r < r' - 1)) = (^-{r' -^l)- Ih-i ^ 

[IT' ^)k—l 

giving the result. □ 
In the sequel, n(-) will denote a continuous-time self-similar fragmentation with characteristic 



pair (—7, u), and Dk, k > 1 will be defined as in Section 3.1.5 
Lemma 9. Under assumption (H'), it holds that 



Xnilant]) 



t > 



n 



id) 



(|n(i)(t)|,t>o), 



in distribution for the Skorokhod topology, jointly with the convergence 

Proof. For n > A; > 1, let Pn^k = IF'(-'^n(l) = k). Note that the process X„ is a non-increasing 
Markov chain started from n, with probability transitions < J < Then by a simple 

exchangeability argument. 



Pn,k= Pn(7r)mfc(7r)- = g„(A)mfe(A)-, 1 < A; < 



n 



where mkin) = mfc(A(7r)) is the number of blocks of vr with size k. Consider the associated 
generating function for x > 



AePn fc=i 



k=l 



n 



Ai\^+i 

n . 



Hence, 1 - = where /(s) = 1 - E.>i s^'^ and by (H'), 

a„(l-F„(x)) ^ / (1- j;<'+i)Kds). 
■'^ i>\ 
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This is exactly what we need to use [25^ Theorem 1], stating that (n~-^X„([a„tJ),t > 0) converges 
in distribution to the self-similar Markov process exp(— ^^(.■)), as defined around Proposition [Tj 
Moreover, this convergence holds jointly with the convergence of absorption times at 1, so a~^D^ 
converges to the absorption time at of exp(— ^^(.)). By Proposition [Tj the process exp(— ^^(.-j) has 
same distribution as (|n(x)(t)|,t > 0), which reaches for the first time at time Di. Hence the 
result. □ 



Finally, the combination of the last two lemmas gives the last of our preliminary ingredients. 
Lemma 10. The following joint convergence in distribution holds: 



Dl 



[A;] nn"(Z) 



n 



Dk,[k]nU{Dk),m,){Dk)li £ [k]) 



Proof. Let vr' G Vk have b > 2 blocks, and f,g : (0,oo) ^ R,h : {0,oof 
functions with compact support. Then by Lemma [8l 



be continuous 



V\a„)-'\ n I Vj)C„(OJ-1)' " " / 



{[k]nU"{D'^)=7r'} 



r'>0 



r 

an 



{Xn{r' - 1) - l)k-l f Xn{r'-l) 
V n 



(n-1) 



.Xnir' - 1) 



1< i < Ml 



-Y.fi- 



E 



' r'>0 



Hn,Xn{r' -l))g 



n 



'iXn{r' - 1)) 



duE 



where 



and 



^>(n,X„([a„nJ - l))^ 
ix - l)k-l an 



Xn{[anU\ - 1) 



n 



^>{Xn{[anU\-l)) 



$(n, x) 



— l)fc-l ax {n,x/n)^{co,c) 



1< i < 6 1 



m 



--n'}) ■ 



Note that the Potter's bounds for regularly varying functions |1H Th.1.5.6.] imply that < 
n > X > A for some finite positive constants C, A. In particular there exists 
some no such that sup„>„g o<x<ra ^{n,x)g{x/n) < oo (since g is null in a neighborhood of 0). The 
joint use of Lemmas [7| and [9] entails by dominated convergence that the expectation term in the 
integral converges to (note that the quantities |vrj|, 1 < i < b are all a.e. positive on {vr|[,^.] = vr'} 
under Ki^) 

E[|n(i)(n)|'=-i-M|n(i)(tx)|) / Ac,(d7r)Mk.|,i<^<&)iH„,=.'} 

and since f,g, h are compactly supported, the whole integral converges to 

/ f{u)duE |n(i)(7.)|^-i-^5(|n(i)(^x)|) / ^M^Mimii < i < b)i{^~ ,y 

Jo L JVn 

which, by Proposition [6j equals 

E[/(zj,)ff(|n(,)(D,-)|)^( 1 

It is now easy to conclude, since |nj(Z)fc)| > almost-surely. □ 



< i < 6 1 



L{[fc]nn{Dfc)=7r'} 
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4.2 Convergence of finite-dimensional marginals 

The first step in the proof of Theorem [l] is the fohowing result on reduced trees Tq of Section 



3.2.4 



Proposition 12. Let B CN be finite. Under assumption (H'), we have the following convergence 
in distribution in 

an ' ' n-^oo 

Proof. We use an induction argument on the cardinahty of B. For B = {i}, one can assume by 
exchangeability (as soon as n > i) that B = {!}, and in this case, the reduced tree is T^^j = (0, D^^ 



while 7^(7^,^, {1}) = {2S,Di) by Proposition 10 By the second part of Lemma [9| under (H'), it 
holds that 

an n— 5>oo 



This initializes the induction. Now, assume that Proposition 12 has been proved for every set B 
with cardinality at most A; — 1, for some k > 2. We want to show that the same is true of any set 
of cardinality [k], and by exchangeability, we may assume that B = [k]. 

We now recall, using Proposition 11, that conditionally on D'^ = i, [k] n n"(Z)^) = vr' having 

6 > 2 blocks and on n"(L>^) = Hi,! < i < b with respective cardinality #7rj = Hi, the tree T^^^ has 
same distribution as 

(0«,l<z<6),, 

where 9^^'^ has same distribution as T^^, and these trees are independent. 

The joint distribution of D^, [A;] n n"(L»^), (#n^.)(D^), 1 < i < A;) is specified by Lemmajsj and 
its scaling limit by Lemma [T0| We obtain by the induction hypothesis that jointly with the above 
convergence, conditionally on [A:] n ir^{D^) = vr'. 



n—^oo 



Ui{Dk)\^T^'\ l<i<b, 



where the T^'^ are independent with same laws as TZ{Ty^,y, tt^) respectively. Finally, a^^Tj-^ converges 
to 

{mDk)rT^'\l<i<b)n,, 



and the M-tree associated with this tree has same distribution as 7^(7^^^, [A;]) by Proposition 10 □ 
4.3 Tightness in the Gromov-Hausdorff" topology 

We now want to improve the convergence of Proposition [12] into a convergence of non-reduced trees 
for the Gromov-Hausdorff topology. Namely 

Proposition 13. Under hypothesis (H'), we have the convergence in distribution 

1 ^ (d) ^ 

-"^n ' '1,1' 
an n->oo 

in 3" , for the Gromov-Hausdorff topology. 

This will be proved by first showing a couple of intermediate lemmas. 
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Lemma 11. Under assumption (H'), we have the convergence in distribution: 



(n"(MJ),t>o) ^ m),t>o) 

jointly with 

" ,t>o) ^ (|n(.)(t)M>o) 

for every i>l, all these convergences holding jointly. 

Proof. The fact that ( [/c] n ( [a„tj ) , t > 0) converges in the Skorokhod space to {[k]nU{t),t > 0) 
for every > 1 is obtained by using an inductive argument similar to that used in the proof of 



Proposition 12 We only sketch the argument. The statement is trivial for = 1, so we can assume 



that k >2. The process [A;] nn"([a„-J) remains constant equal to [k] up to time a^^D^, and jumps 



to the state vr' = [A:] n vr, vr = n"'(L>^). By Lemma 10, a^^^D"^ — t- as n — )• oo, and the latter has 
a diffuse law by Proposition |6j 



After time a^^^D"^, given n, the restrictions vr^ n n"'([a„-J + D'^) have same distribution 



as 



TT^ n n'^*([a„-J), and are independent. By the induction hypothesis and exchangeability, still con- 
ditionally on vr, this converges to vr^ n nW(-), where U^'\i > 1 are i.i.d. copies of 11. Moreover, 
since the jump times have diffuse laws, two such copies never jump at the same time, from which 
one concludes that given vr, the process (vr^ n n'^([a„tj + i^^),! < i < 6(7r'),t > 0) converges in 
the Skorokhod space to (tt^ R nW(t), 1 < i < b{7r'),t > 0). This concludes the inductive step by 
plugging the initial constancy interval of the process, with length a~^D^. 

The convergence of n'^([an-J) in the Skorokhod space follows, because dp{[k] n 7r,7r) < e^'^ for 
every n GVfq. This shows that [k] n n"([a n-\) remains uniformly close to n"([an'J). 

Next, by Lemma [2j it follows that, jointly with this convergence, for every i > 1, the finite- 
dimensional marginals of (n~^#n".^([antj),t > 0) converge in distribution to those of (|n(j)(i)|,t > 
0), at least for times which are not fixed discontinuity times of the limiting process — the set of 
such points is always countable, and it turns out that there are none in the present case. Since 
we also know that the laws of the processes (n~"'^#n".^([antj),t > 0) are tight when n varies, by 
Lemma [9] (these processes all have same distribution as (n Xn{[ant\),t > 0) by exchangeability), 
this allows to conclude. □ 



For + 1 < i < n, let 

5r = inf{r>O:[fc]nnf,)(r)=0}, 

the first time when the ball indexed i is separated from the k first balls. The random variables 
Sf,k + l<i<n have same distribution by exchangeability. The strong Markov property at the 
stopping time 5" also shows that conditionally on n"^-|(5") = B, the process {B nll^ {S f + r),r > 0) 
has same distribution as 11^. The tree tBnU"{S"+-) has thus same distribution as Tb, and can be 
seen as a subtree of T[„], characterized by the fact that this subtree contains the leaf labeled i, does 
not contain any of the leaves labeled by an element of [k], and is the maximal subtree of T[„] with 

Ik] 

this property. In particular, the Gromov-Hausdorff distance between Tj^j and Tj„] is at most 

dGKiTj^],T[^]) < ^ max^ht (tn^^^(s-)nuHsr+-)) ' 

where ht(t), called the height of t, is the maximal height of a vertex in t. 

Note that if j G n|?.)(5f ), then = S^. Therefore, the blocks n^.)(5f ), fc + l < i < n are either 
disjoint or equal. Moreover, the partition vr of [n] \ [k] with these blocks is clearly exchangeable. 
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By putting the previous observations together, we obtain by first conditioning on vr, and for every 
r/ > 0, 



dGH(Tg,T[„]) >r/aj <E 



i>l 



7? > 0. 



(15) 



At this point, we need the following uniform estimate for the height of a P„-distributed tree, which 
is the key lemma of this section. 

Lemma 12. Assume (H'). Then for all p > 0, there exists a finite constant Cp such that: 

I \ C 

P^ht>xa„j<^, Vx>0,Vn>l. 



Before giving the proof of this statement, we end the proof of Proposition 13 Using Lemma 12 



for p = 2/7 and (15), we obtain 



2/7 -, 



2/7 



By the exchangeability of the partition tt of [n] \ [k], note that for every measurable function /, 



nfi#Hk+i))] 



1 



n — k 



-E 



i=k+l 



E 



^-^ n — k 



vr,; 



This finally yields 



2/7 



2/7 -1 



Since the sequence [a^^n > 1) is strictly positive and regularly varying at 00 with index 1, 
we get from the Potter's bounds ([HI Th.1.5.6.]) the existence of a finite constant C such that 
{a]J'^ k-^) / {aT' n-^) < C^/kJ^ for all 1 < /c < n. Hence, 



P (dGK{Tl^lT[^]) > van) < CC2/^r/-2/^E 



n 



Note that the quantity in the expectation is bounded by 1. By Proposition [IT} it holds that 
S^j^i/an Sk+i in distribution as n — )• 00, where S^+i = \ni{t > : [fc] n n(fc^x)(0 = ^} ■ This 



convergence holds jointly with that of (n-l#^?^^^^([a„^J), t > 0) to {\Ii(^k+i){t)\,t > 0) in the 



Skorokhod space, whence we deduce that 

hmsupP (dGn{T}^],T[n]) > mn) < CC^/^rj-^^m \J\U^k+i){Sk+i-)\ 



Let S'j^^^ = inf{t > : {2, 3, . . . , A; + 1} n n(i)(t) = 0}, then by exchangeability, 
hmsupP fdGH(TS,rr„,i) > van) < CC,/^v-^/m \J\Ua){S'-)\ 



Since the quantity in the expectation goes to a.s. as /c — )• 00 and is bounded (indeed 5^ t -^{1} 
a.s. and n(i)(Z)|i}.— ) = by ([2]) and Proposition [7]), we conclude that for every r/ > 0, 

(16) 



[k] 



lim limsupP ^^GH(T'^^^ Tr„|) > 7?a„ = 



It is now easy to conclude from this, Proposition 4.2 and the fact that TZ{T^^y, [k\) converges is 
distribution in (=^,(iGH) to T-f,u as A; — )• 00 ([21]), using [lOl Theorem 4.2]: 
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Lemma 13. Let Xn, X, X^, X'^ be random variables in a metric space {M,d). We assume that for 
every k, we have X^ — X^ in distribution as n ^ oo, and X^ X in distribution as k ^ oo. 
Finally, we assume that for every r] > 0, 

lim limsupP(d(X^,X„) > r?) = . 
Then Xn X in distribution as n ^ oo. 



Proof of Lemma \12\ Note that if the statement holds for some p > 0, it then holds for all 
q G {0,p). We can therefore assume in the following that p > I/7 and we let e > be so that 
p{j — e) > 1. The main idea of the proof is to proceed by induction on n, using the Markov 
branching property. We start with some technical preliminaries. 

• First note that En[ht'^] < 00 for all r > and all n > 1. This can easily be proved by induction 
on n (r being fixed) using the Markov branching property, and the facts that qni{n)) < 1 and that 
Hi = 1 almost-surely. 

• Second, we replace the sequence {an,n > 1) by a 'nicer' sequence {dn,n > 1) such that dn ~ an, 
i.e. an/dn — t- 1 as n — t- 00 (this step is trivial when a„ = n'^; we then take a„ = an). Since 
{an,n > 1) is regularly varying at 00, it is well-known (see [Hi Theorem 1.3.1]) that it can be 
written in the form 

On = c(n) exp e{u)du/u^ , n > 1, 
where c(n) — 7>c>0asn— )-oo and e is a measurable function that converges to as cxd. Define 

dn = cexp e{u)du/ii^ , n > 1. 

We claim that there exists an integer > 1 such that for n > n^, 

^< f-V ' Vl<fc<n. (17) 
an \nj 

Indeed, let ttg be such that |e(ti)| < e for all u>Us. For n > k > u^, we have 



<e / du/u < eln{n/k) 



hence 



-e{u)du/u 



— = — exp — / eiujdu u < — 
On \nj \ Jk J \n. 



Besides sup^g|;^ j^^^jj. afc/c"^ '^/(a^n^ ''') < 1 for all n large enough (say n > n'^). Hence d^/dn < 
{k/ny~'^ for all n > = max(n£,ne) and all 1 < < n. 

• Since dn > for all n > 1 and dn ^ an, there exists some c > such that > ca„ for all n > 1. 
It is therefore sufficient to prove the existence of a finite Cp (a priori different from the one in the 
statement of the lemma) such that 

C 

PI (ht > xdn) < Vx > and n > 1, (18) 



to finish the proof of the lemma. In order to prove (18), we will use the integer introduced 
around (17) and we will further assume, taking rig larger if necessary, that a„ > 1 for every n > n^. 
Introduce now < Cp < 1 such that 

(1 - u)-P < 1 + 2pu, VO<n<C^. 
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Using (H') and the fact that qn{{n)) < 1 for all n > 1, there exists also > such that 

> C^, Vn > 1 (19) 



(recall that (7 — e)p > 1 and a„ > for all n > 1). Last we let 



Cp(n,):= max (E^iht^^JK) < 00 

l<n<nE 



and we set 



Cp := max (Cp(n,), (l/C^, 2p/C2)^) < 00. 
Our goal is to prove by induction on n > 1 that 

C 

P«(ht < xdn) >l-^, for every x > 0, (A„) 

Clearly, (A„) holds for all n < since Cp > Cpijie) and Pn(ht > xa„) < En[ht^]/(xa„)P. Now 
assume that (A^) is satisfied for all < n — 1 for some n > n^. For all < x < Cp^^ , the expected 
inequalities in (A„) are obvious, so it remains to prove them for x > Cp^^ . To get (A ji), we will 
prove by induction on i E N that 

P^(ht < xdn) > 1 - % , for every x £ ( 0, ^ ) , (A„,i) 
xP \ anj 

which will obviously lead to (A„). Note first that (A„^i) holds since 1/dn < 1 < Cp^^. Assume 
next that (A„^j) is true, and fix x G (0, {i + l)/a„). Using the Markov branching property and the 
fact that (Afc), holds for every k < n — 1, as well as (A„^j), we get 

P(A) 

p^(ht < xdn) = Yi ^'^(^^ n ^i^^^ < - ^) 

PW / r nP \ 



pW r< nP 



IP 



using the notation r+ = max(r, 0) and that for all sequences of non-negative terms bi^i > 1, 

riilLiCl ~ > 1 — Si^Li^i) for every m > 1. We can assume that x > C^^ since (A„) holds 
otherwise. In particular, xdn > x > 1/Cp > I. Therefore, 

1 _ 1 ^ 1 + 2p/{xdn) 



(xdn - 1)P {xdn)P{l - l/{xdn))P ~ (xa„)P ' 

and then 

P^(ht < xdn) 

> E -(^) - ^ E -«E m - Mt, E '"<^>i; d^' 

AeP„ AeP„ i=i AePn j=i 

AeP„ i=i AeP„ j=i 



by ((Tg 

^ - ^ - 1 E .»(^) (- E (^)'-^' j - E .»(.)E (^)'-'" 

AeP„ \ i=i / " xeVn i=i 
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We then use (19) and the fact that Yli^ii^i/'^)''^'''^'^ < 1 (since (7 - e)p > 1) to get 

P^(ht<xa„)>l-% + ^fc,2-^ 
xP xPan \ ^ X 

By assumption, x > Cp^^ > 2p/Cp, hence 

P^(ht < xcLn) > 1 ^ for every x G ^0, — 

as wanted. □ 



4.4 Incorporating the measure 

We now finish the proof of Theorem [T| by improving the Gromov-Hausdorff convergence of Propo- 



sition 



13 to a Gromov-Hausdorff-Prokhorov convergence, when the uniform measure i^n = IJ-dTn on 
leaves is added to T„ in order to view it as an element of ^ rather than c"^. 

We win use the fact |Z2t Lemma 2.3] that the convergence in distribution of a~^r„ as n — >■ 00 
in ^ entails that the laws of the random variables a~^Tn, form a tight sequence of probability 
measures on Therefore, it suffices to identify the limit as 7^,i/. 

So let us assume that a~^T„ converges to {T' ,d' , p' , G in distribution, when n — )• 00 
along some subsequence. Let L",L2, . . . be A; i.i.d. uniform leaves of T„. Conditionally given 
the event that these leaves are pairwise distinct, which occurs with probability going to 1 as 
n — )■ 00 with k fixed, these leaves are just a uniform sample of k distinct leaves of r„, so by 
Lemma |6] and exchangeability, the subtree of T„ spanned by the root and the leaves L", . . . , has 
same distribution as T^^. By Proposition |l2| we know that o~^Tj'^j' converges in distribution to 
n{T,,^, [k]) in^. 

A fc-pointed compact metric space is an object of the form {{X, d), xi, . . . ,Xk) where {X,d) is 
a compact metric space and xi, . . . G X. The set of /c-pointed metric spaces can be endowed 
with the A;-pointed Gromov-Hausdorff distance 



"gh 



i{{X,d) , Xl , . . . , X]( , x'l, . . . , x'jA ] — inf max dist((^(a;j), 0'(x^))VdistH(0(^), j 

\ / l<i<k 



where, as in the definition of the Gromov-Hausdorff distance, the infimum is over isometric em- 
beddings (j),4>' of X,X' into some common space (M, dist). Note in particular that c^q^ = c^gh- 
Now, the fact that a~^T„ converges to {T' ,d' , p' , p,') in ^ implies that the k + 1-pointed space 
{a~^Tn, p, Li, . . . , L^) converges in distribution to (T', p' , Li, . . . , L^), where Li, . . . ,Lk are i.i.d. 
with law p' conditionally on the latter. See [30l Proposition 10] for a proof and further properties 
of the fc-pointed Gromov-Hausdorff distance, which in particular is Polish. 

If (T, d, p) is a rooted M-tree and xi, . . . , x^ £ T, the union of geodesies from p to the Xj's 

k 

R{T,xi,...,Xk) = [J[[p,Xi\] 

i=l 

is in turn an M-tree rooted at p with at most k leaves, called the subtree of T spanned by xi, . . . , 
(the role of the root being implicit). 

Lemma 14. Let {An,dn, Pn),n > 1 be a sequence of rooted M-trees and x", . . . ,x^ be k points in 
An, such that {{An, dn), Pn,Xi, . . . jx'^) converges for the k + 1-pointed Gromov-Hausdorff distance 
to a limit {{A, d), p,xi, . . . ,Xk)- Then the subtree R{An, x", . . . , x^) converges in to the subtree 
R{A,xx, . . .,Xk). 
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We will prove this lemma at the end of the section. By using the Skorokhod representation 
theorem, we may assume that the convergence of {a~^Tn, p, L^, . . . , L^) to (T', p' , Li, . . . , L^) holds 



almost-surely. This, together with Lemma 14 and the discussion at the beginning of this section, 
implies the joint convergence in distribution in £^ of a~^T[„], a~^Tj'^j' to T',T^, still along the 
appropriate subsequence, where 7^' is the subtree of T' spanned by Li, . . . , L^. In particular, this 
identifies the law of 7^' as that of Tl{%/^u, [k])- When k — )• oo, we already stressed that the latter 
trees converge (in distribution in with the uniform measure on the set of its k leaves) to 
T-y^u- On the other hand, 7^' converges a.s. to {T" ,d' , p' , n') in as k ^ oo, where T" is the 
closure in T' of 

oo 



But the joint convergence of Tj„],Tj^j^ in ^ along some subsequence and (16) imply that for every 
r/ > 0, limf:^^F{dQu(Tk,T') > r]) = 0. So T" = T' a.s., entailing that {l\d\p' ,p!) has same law 
as T^^u- This identifies the limit of a~^Tn in as T^,u, ending the proof of Theorem nl 



It remains to prove Lemma 14, We only sketch the argument, leaving the details to the reader. 
We use induction on k. For k — 1, the subtree i?(^n, Xi) is isometric to a real segment [0, dn{pm ^i)] 
rooted at 0. The 2-pointed convergence of {{An, dn), Pn, Xi) to {{A, d), p, xi) entails that dn{pn, Xi) 
converges to d{p, xi), hence that R{An, Xi) converges to [0, d{p, xi)] rooted at 0, which is isometric 
to R{A,xi). 

For the induction step, we use the general fact that if ^ is a rooted R-tree and xi, . . . , Xk, Xk+i G 
A, then the distance between x^+i and the subtree of A spanned by xi, . . . , x^ is equal to 

. /d{xk+i,Xi) + d{xk+i,p) - d{xi, p) 

dk+i = mm 

i<i<k \ 2 

Moreover, if i S {1, 2, . . . , A:} is an index that realizes this minimum, then the branchpoint x^+i Axj 
is at distance 5k+i from Xk+i and is the ancestor of Xj at height (i.e. distance from p) 

hk+i = d{p, Xfc+i) - 5k+i ■ 

Roughly speaking, we get that R{A, xi, . . . , x^+i) is obtained from R{A, xi, . . . , Xfc) by grafting a 
segment with length 6k-\-i at the ancestor of Xj with height hk+i- 

In our particular situation, and with obvious notations, we get that R{AmX^, . . . ,x^^-|^) is ob- 
tained by grafting a segment with length to the ancestor of x^ with height h^_^^, where in is 
some index in {1, . . . ,k} that can depend on n. Up to extracting, we may assume that in — z is 
constant. The A:+2-pointed convergence of {{An, dn), Pn, Xi, . . . , x^_^^) to {{calA, d), p, xi, . . . , Xk+i) 
entails that the d^-distances between elements of x", . . . , x^_^^} converge to the correspond- 
ing d-distances of elements in {p,xi, . . . ,Xfc}. Consequently, it holds that S]^__^^, h'^j^-^ converge to 
6k+i, hk+i, defined as above. Together with the induction hypothesis stating that R{An, x", . . . , x^) 
converges in £^ to R{A, xi, . . . , x^), this entails easily that R{An, x", . . . , x^_^^) converges in ^ to 
the M-tree obtained by grafting a segment with length 6k+i to the ancestor of Xj with height h^+i 
in R{A, xi, . . . , Xk), and this tree is R{A, xi, . . . , x^+i). The result being independent of the par- 
ticular value of i (selected by the choice of an extraction), the convergence holds without taking 
extractions, which concludes the proof. 

4.5 Proof of Theorem [2] 

To pass from trees with n vertices (with law Q^) to trees with laws of the form Pn , with n leaves, 
we introduce a transformation on trees, in which every vertex which is not a leaf is attached to an 
extra "ghost" neighbor, which is a leaf. 
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Precisely, if t is a plane tree, then the modification t° is defined as 

t° = tU \J {{ui,...,Uk,Cuit) + l)}. 

u={ui ,...,Uk)&t\dt 

If we are given a tree rather than a plane tree, then this construction performed on any plane 
representative a the tree t will yield plane trees in the same equivalence class, which we call t°. 
Note that 

#at° = #t. 

We see t° as an element of (endowed with graph distance and uniform distribution on dt°), 
and view t as an element of by endowing it also with the graph distance, but this time, with 
the uniform distribution /zt on t. It is easy to see, using the natural isometric embedding of t into 
t°, that for every o > 0, 

dGHp(at,at°) < a. (20) 



Let {qn,n > 1) be, as in Section 1.2.2, a family of probability distributions respectively on Vn, such 
that qi{{l)) = 1. We introduce the family q^,n > 1 of probability measures respectively on Vn by 
gf(0) = 1, and 

C+i((A,l)) = (?n(A), n>l,XGVn, 

where (A, 1) = (Ai, . . . , Ap(A), 1) G Vn+i- 

It is then immediate to show by induction that if Tn has law Qn , then T° has law Pn , with the 
notation of Section 1.2.1 We leave this verification to the reader. In view of this and (20), we see 
that Theorem [2] is a straightforward consequence of the following statement. 



Lemma 15. // {qn,n > 1) satisfies (H) with either 7 G (0, 1), or 7 = 1 and i{n) as n 
then (g°, n > 1) satisfies (H), with same fragmentation pair (—7, i') and function I. 



00, 



Proof. Let / : 5^'' — )• M be a Lipschitz function with uniform norm and Lipschitz constant 
bounded by K. Let also g{s) = (1 — si)/(s). Then 



(A,l) 
n+l 



n 



P(A) 



< 



A,; 



+ 



K 



so that 



1 



^ n(n +1) n+l 
(A,l 



< 



2K 



n 



1 ' 



< 



< 



3K 
n + l 



Ai 
n + l 

KXi 



f 



+ 



n+l 
2K 



n 



n 



n{n + 1) n + l 



multiplying both sides by n'~'£{n), we see that the upper-bound converges to as n — )• 00 under our 
hypotheses. Since n'^i{n)qn{g) converges to v{g) by (H), we obtain the same convergence with g° 
instead of This yields the result. □ 

4.6 Proof of Proposition |4] 

Recall the notation K^^\n) for the decreasing sequence of sizes of blocks restricted to {1, . . . , n} of 
a random variable with painbox distribution ps(d7r), with s G 5^'-, Yl,i>i^i — 1- Recall also that 
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A.^^\n)/n s m almost-surely. Now set for A G Vn, 

g„(A) = n-^ [ P(A(^)(n) = A)l|„-,/2<i_.,}Kds), A / (n), 



Qn{{n)) = 1 - ^ qnW- 
Ae-P„,A7^(n) 

For n large enough, say n > uq, 

< g„(A) < /" (1 - si)i/(ds) < 1, 

hence qn defines a probabihty distribution on Vn such that qniin)) < 1. Then set qn = (Jn for 
n> no and for n < uq let be any distribution on Vn such that qn{{n)) < 1. 
Next, consider a continuous function f : ^ M4.. For n > ng, we have 



n^>^g„(A)(l-::^)/(^)=^^E 



, f . 

n \ n 



l{n-7/2<l-si}J^(ds) 



which converges to J^^ — si)i^(ds) as n —t- 00 by dominated convergence. This concludes the 

proof. 

5 Scaling limits of conditioned Galton- Watson trees 



Recall the notations of Section 2.1 While the probability distribution GW^ enjoys the so-called 



(n) 

branching property, it holds that the conditioned versions GW^ are Markov branching trees. 

Proposition 14. (i) One has GW^"''' = Qn for every n > 1, where the splitting probabilities 
q = {qn, n > 1) are defined by gi((l)) = 1 and for every n > 2 and A = (Ai, . . . , Ap) G Vn, 

nj>ii^jW- GW5(#t = n + l) 

(ii) On some probability space let Xi,X2,.-. be i.i.d. with distribution F{Xi = k) = 

GW5(#t = k), and set Tp = Xi + X2 + . . . + Xp. Then 



i{p{X) =p)= 



{Tp = n) 



P(ri = n + 1) ' 



and qn{- \ {p(A) = p}) is the law of the non-increasing rearrangement of {Xi, . . . , Xp) conditionally 
on Xi + . . . + Xp = n. 

Proof, (i) Under GW^ (viewed as a law on plane trees), conditionally on C0 = p, the p (plane) 
subtrees born from are independent with law GWg. For integers ai,...,ap with sum n, the 
probability that these trees have sizes equal to ai, ap is thus nf=i GW(^{#t = ai). Hence, 



GW--^(c. = p, #t. = a.,l<^<p) = ^(p) ^gf^^^^f^r;^ , (22) 



and conditionally on the event on the left-hand side, the subtrees born from the root are indepen- 
dent with respective laws GW^'^''',! < i < p. Letting A be the non-increasing rearrangement of 
(ai, . . . , Op) and re-ordering the subtrees by non-increasing order of size (with some convention for 
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ties, e.g. taking them in order of appearance according to the plane structure), we see that these 

subtrees are independent with laws GW^'^"'', 1 < i < p. Using the fact that there are p!/ Hj^i "T-il'^)' 
compositions (ai, . . . ,ap) of the integer n corresponding to a partition A G Vn, and viewing GWg 
as a law on T instead of plane trees, the conclusion easily follows. 

(ii) We have (?n(p(A) = p) = Gwl'^'^"^'' (c0 = p), and the wanted result is just an interpretation 



of (22). □ 



On the same probability space {Q,T,F) as in the previous statement, we will also assume that 
{Sr,r > 0) is a random walk with i.i.d. steps, each having distribution ^{i + > —1. Then the 
well-known Otter-Dwass formula (or cyclic lemma) [32\ Chapter 6] allows to rewrite 

,„(p(A) = p) = m ^^^^^^^ ^ ^ — «''> p(5„,. = -1) ' 

where ^(p) = pC{p) is the size-biased distribution associated with ^. 

It is often convenient to work with size-biased orderings of the sequence (Xi, . . . ,Xp) rather 
than with its non-increasing rearrangement. Recall that if {xi,X2, ■ ■ ■) is a non-negative sequence 
with Xj < oo, we define its size-biased ordering in the following way. If all terms are zero then 
we let = 0, otherwise we let i* be a random variable with 



and set = Xi*. We then remove the z*-th term from the sequence {xi,i > 1) and resume the 
procedure, defining a random re-ordering (x*,X2, . . .) of the sequence {xi,X2, ■ ■ ■)■ The size-biased 
ordering {X*, X2, ■ ■ ■) of a random sequence {Xi,X2, ■ ■ ■) is defined similarly, by first conditioning 
on {Xi,X2, . . .). If /i is the law of {Xi,X2, . . .), we let /x* be the law of (XJ',X|, . . .). 

If ^ is a probability distribution on S'^, then fi* is a probability distribution on the set Si = 
{x = {xi,X2 . . .) G [0, 1]^ : X^j>i Xi < 1} which is endowed with any metric inducing the product 
topology — in particular. Si is compact. Similarly, if is a non-negative measure on S\ we 
let fj,*{f) = /i(ds)E[/(s*)], for every non-negative measurable / : 5i — )• M+, where s* is the 
size-biased reordering of s. The following statement is a simple variation of ^ Proposition 2.3], 
replacing probability distributions with finite measures. 

Lemma 16. Let > 1 and ji he finite measures on S^ , and assume that fi is supported on 

{s£S^ ■.J2.Si = l}. Then fin converges weakly to /x if and only if /i* converges weakly to fi* . 

5.1 Finite variance case 

Here we assume that ^ has finite variance ^p>iP{p — l)?(p) = o"^ < 00. In the proofs to come, C 
will denote a positive, finite constant with values that can differ from line to line. 



Our goal is to check hypothesis (H) for the sequence q of (21 ), and for the measure v = {a/2)v2. 



Due to Lemma 16 , it suffices to show that 

n'l\{l - si)qMs)r H (a/2)((l - .i)z.2(ds))* . (24) 



Now, for any non- negative measure /i on S'^ and any non-negative continuous function / on 5i, one 
can check that 

((l-.OMds))*(/) 



/ /i*(dx)(l-maxx)/(x), (25) 
JSi 



where maxx = maxj>i Xi. Applying (25) to = g„ and 1^2, we conclude that (24) is a consequence 
of the following statement. 
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Proposition 15. Let f : Si 



be a continuous function and let g(x.) = (1 — maxx)/(x). Then 



dx 



jg{x,l- x,0,...) 



In summary, Theorem [3] in Case 1. is a consequence of this statement and Theorem [2] 



(26) 



Proposition 15 will be proved in a couple of steps. A difficulty that we will have to be careful 
about is that x i— )• maxx is not continuous on Si. Fix /, as in the statement. Note that < 
1 — maxx < 1 — xi for every x G Si, so that g{x) < C(l — xi) for every x G 5i for some finite 
C > 0, a fact that will be useful. 



First, note that combining (ii) in Proposition 14 with a size-biased ordering, it holds that 



C(5) = E'?"(?'(^)=^)^ 

p>i 



{X^, . . . , X*0, . . .) 



n 



(27) 



Lemma 17. For every e > 0, 



0. 



Proof. From (23), the local limit theorem in the finite- variance case 

n2 



sup 



n¥{Sn = -p) 



1 



■ exp 



\/27r cr2 

shows that qn{pW = p) < C^{p) for every n,p. Now 

^i{{k,cc)) < oo , 



P' \ 
2na'^ ) 



0, 



(28) 



fc>0 



because has finite mean. Since .^((A;, cxd)) is non-increasing, this entails that ^((/c,oo)) = o{k ^). 
Hence the result. □ 



Lemma 18. One has 



limlimsup\/ng*(|5r|l{^^>i_^}) = 0, 



and 



lim \/^g;^(l{^^<„-r/8}) = 0. 



Proof. Let e,?7 > 0. Since |/(x)| < (1 — xi), we get using (27), that -^/nf/* (|/|l|^.j>i_^}) is 
bounded from above by 

1/2 I i\\ \ \^ (^ mi\ pmi¥{Xi = mi)¥{Tp_i = n - mi) 



l<p<eni/2 



mi>(l— 77)71 



because 



¥{X^ = m\Xi + ... + Xp = n) 



pm F{Xi = m)F{X2 + . . . + Xp = n - m) 



n 



F{Xi + ...+Xp = n) 



The 0(1) term accounts for the fact that we restricted the sum to 1 < p < en^/^, which costs at most 
o(n^^/^) by Lemma 17 We now use the cyclic lemma again, entailing that P(Tp = n) = {p/n)¥(Sn = 
—p). Using this for p = 1, and with the help of (28), we obtain that P(ri = n) ~ (cj\/27r)~^n~^/'^ 
as n — )• cxD. This yields the upper-bound 



l<p<£rai/2 (1— r;)n<mi<n 



n P(5, 



n—m-i 



p+l) 



+ 0(1), 
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and since I < p < en^^"^, (28) implies that ^/nF{Sn = —p) and -y/n — mi¥{Sn-mi = —p + 1) are 
respectively bounded from below and above, by constants that are independent on n,mi,p (but 
might depend on e). Consequently, the bound is 

P>1 (l-r;)n<mi<n Y ^ (^i - 

and this converges to Cg f^_^{x{l — x))^^^'^dx. In turn, this goes to as — )• 0, for fixed e. The 
second limit is obtained in a similar way, writing \/^9n(l{xi<n-7/8}) as 



n' ^ MX)=P) ^ +o(l) 

l<p<£ni/2 l<mi<ni/S 

< C,n-V4^p(p-l)e(^,)+o(l), 
P>1 

for some constant Cg, where we used the local limit theorem at the last step. □ 
Lemma 19. For every ij > 0, it holds that 

lim Vnq*ni^{xi+x2<i-ri}) = 0. 

Proof. Fix e > 0. An upper-bound for the quantity appearing in the statement is given by (up to 
an additional o(l) quantity depending on e) 

/- Sr^ ^ pmi (p - l)m2 P(Xi = mi)P(X2 = m2)P(rp_2 = n - mi - 

Vn 2^ p^[p) 2^ n-mi P(rp = n) ' 

l<p<eni/2 m\+m.2<[l—'q)n 

If nil + "12 < ri{l — rj) then n — nii — m2 > 'r]n. In this case, we obtain, using the cyclic lemma and 

^(^^-2 = n - mi - m2) _ „_mi-m2^*^^'^-'"l-"^2 



£=2 — _ _p + 2) ^ C 



P(Tp = n) £P(5„ = -p) - 773/2 • 

Note that the constant C here does not depend on p, e. Consequently, we obtain the bound 



rf'l'^^n — ' " ' ■ — ' V mi m2 

l<p<eni-/2 mi+m2<(l-»;)n 

Ce sr^ 2ci \ I dxid2;2 

where C is still independent of p, e. The first term on the right-hand side does not depend on n 
anymore and goes to as e — )• 0, entailing the result. □ 

Lemma 20. There exists a function = o{r]) as rj ^0, so that 

limliminf Vng;(5'l{xi<i-,,,xi+x2>i-/3„}) = ^^^^^^^^^P Vnq*ni9^{xi<i-r^,xi+x2>i-M) 

r;4,u n—^oo r/lO n— s>oo 

o- g{{x,l-x,0,...))^^ 



2-K Jo xV2(l _ 2;)3/2 
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Proof. The proof is similar to the previous ones, but technically more tedious, so we will only 
sketch the details. Fix r] > 0, and consider rj' S (0, ij) and e > 0. Then, by decomposing with 
respect to the events {p{X) > e^/n} and {x : xi < n~^/®}, we obtain, using Lemma |l7| and the 
second limit of Lemma flSl 



Vnq*n{9^{xi<l-r,,x^+X2>l-r,'}) = ^(1) + ^ QnipW = P) 

l<p<eni/2 

X X! ^g{{mi,m2,X^,...,X*)/n)\Tp = n,Xl =mi,X2 =1712] 

nl/*<mi<(l— »y)n 
{l—ri')n<mi+'m2<n 

n 1 — mi/n r{Tp = n) 

We now give a lower bound of the liminf of this as n — )> oo. Showing that the same quantity is an 
upper-bound of the limsup being similar and easier. 

Note that if X1 + X2 > 1 — r]' and xi < 1 — r/, we have that (1 — xi — X2)/(l — xi) < r]' /r] and then 
3:2/(1 — xi) > 1 — r]' /f]. Next, by the local limit theorem, we can always choose r]' small enough so 
that F(Xi = m2)/F(Xi = n — mi) > 1 — rj for every n large enough. 

Also, still by the local limit theorem, we can choose e small enough so that for every 1 < p < 
en}!'^ and every n large, we have 



g„(p(A) = p) li{p) > (1 - r/) and p"^n^/¥(rp = n) > (1 - r\)a^ . 

A third use of the local limit theorem entails that for every n large, we have 

= mi) A mj'^F{X2 = ^2) > (1 - 'n)aV2^ , 

for every n large and mi > n^^^, m2 > {n] — ri')n. 

Finally, we use the fact that / is uniformly continuous on Si, while maxx = xi V X2 on the 
set {x e 5i : xi + X2 > 3/4}. Consequently, the function g^x) = (1 — maxx)/(x) is uniformly 
continuous on the latter set. Therefore, we can choose < 1/4 small enough so that 

|5((mi,m2,m3, . . .)/n) - g{{mi,n- mi, 0, . . .)/n)| < r] 

for every (mi, m2, . . .) with sum n, such that mi-l-m2 > (1 — ??')n. Putting things together, for every 
r/ > 0, we can choose rj' =: /3,j, e small so that for every n large enough, y/nq^{gl^xi<i-ri,xi+x2>i-ri'}) 
is greater than or equal to 



[l-rif{l-r]'/ri) ^ (p-l)l(p)- J2 {g{{mi,n - mi,0, . . .)/n) - i]) 



mi 



n ^ — ' n 



1 



aV2TT{{mi/n){l - mi/n))3/2 



P(Tp_2 = - "li - m2). 

(1— »y')n— mi<m2<?i— mi 



Finally, the last sum is X^m=o^('^p-2 = this can be made arbitrarily close to 1, uniformly 

in 1 < p < en^/'^, as soon as n is large enough, by our usual use of the local limit theorem. Taking 
the liminf in n and using a convergence of Riemann sums, yields 

^^^^^^Vnq*n{9^{xi<l-rj,xi+X2>l-v'}) 

- f^~^ dx 

> (1 - Vfil - V'h) Y.(P - l)^(^) I .^,1/2(1 _,)3/2 (g(^' 1 - X, 0, . . .) - 
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One concludes using the fact that Ep>i(l' - l)C(p) = • □ 
We can now finish the proof of Proposition [Tsj Simply write 

iQnig) -Qn{9'^{xi<l-n,xi+X2>l''n'})\ < C(l5|l{xi>l-»7})) + Cdsl l{xi+X2<l-»?'}) • 

choose 7], T]' in such a way that -^/ng* (|(7|l|2,.j>i_^}) < 
^ g{{x,l-x,0,...)) 



Now fix e > 0, and using Lemmas 
e/2 and 



18 



and 



20 



^{xi<l—ri,xi+X2>l~'q' 



}) 



-dx 



/2ttJo x1/2(i _a;)3/2 
for every n large. For this choice of rj, r]' , we then have for every n large enough, 

"1 g{ix,l-x,0,...)) 



< e/2 



a 



2^ Jo Xl/2(1 _ a;)3/2 



-dx 



<£ + qn{\g\'\-{xi+x2<i-n'}) 



and the upper-bound converges to e as n — )• oo by Lemma [T9| Since e was arbitrary, this proves 
Proposition 15, hence implying Theorem [3] in Case 1. 

5.2 Stable case 

Assume that ^(p) ~ cp~"~^ for some a G (1,2) and c > 0. Theorem [3] in Case 2. will follow if 
we can show that hypothesis (H) holds for 7 = 1 — 1/a, I = (a(a — l)/(cr(2 — 0)))^^°' , and the 
dislocation measure . A similar reasoning as in the beginning of the previous section shows that 
it suffices to prove the following statement. 



Proposition 16. If f : Si 



is a continuous function and g(x.) = (1 — maxx)/(x), then 



n 



1— l/a— * 



r(2-Q)xi/" . 



a{a — 1) 



<{9). 



One will note that the function g of the statement is continuous z^*-a.e., since x 1— )• maxx is 
continuous at every point x with sum 1. Now, 



da) 



p>l 



9 



{X*, . . . , X*, 0, . . .) 



n 



n 



n 



9 



(Xi*,...,X*^i/„^^,0,...) 



n 



n 



(29) 



Recall the notation around ( 23 ) . The random walk Sn is now such that {S\r,t\ Z^^''", * > 0) converges 
in distribution in the Skorokhod space to a spectrally positive stable Levy process (li,t > 0) with 
index a and Levy measure cd2;/a;-'^"'""l|j,.>o}- Its Laplace transform is given by E[exp(— Al^)] = 
exp(te'A"), where c' = c |^|^~°| . The Gnedenko-Kolmogorov local limit theorem also yields 

ni/"P(5„ = k)= pi{k/n^l'^) + e(n, k) 



where sup^ |e(n, /c)| — )• as n — )• 00, and pt is the density of Yt. This, together with (23) and our 
hypothesis on the asymptotic behavior of ^, entail that 



qn{p{X) = \n^^°'x'\) ~ cn """x 



1 _„Pi(-a;) 



Pi(0) 



Let us now focus on the random variables Xi, X2, . . . and Tp = Xi + . . . + Xp. We have P(A'i 



n] = n 



-1 



-1) ~ n ^ ^/"pi(0), which gives that Xi is in the domain of attraction of a stable 



44 



random variable with index 1/a. More specifically, one has that (T|^nxJ 7''^° ) x>Q) converges in the 
Skorokhod space to a stable subordinator {Ty,y > 0) with index 1/a, and Levy measure 

dx 

^^i(0)^mA^l{->o}- (30) 

Its Laplace transform is given by 

E[exp(-AT,)] = exp(-xpi(0)ar(l - l/a)Ai/") . 

On the other hand, has same distribution as the first hitting time of —x by (It, t >0) (because 
a similar statement is true of Tp and S'„), which identifies the Laplace exponent of Ti as (A/c')^/°, 
see [71 Chapter VII]. This yields 

1 1 /a{a — \ 1/" 

" ar(l - l/a)(c')V" " ar(l - 1/a) \cT{2 - a)) ' ^^^^ 

Let Qy be the probability density function of Ty. The cyclic lemma [71 Corollary VII. 3] gives 
tQxit) = xpt{—x), while the Gnedenko-Kolmogorov local limit theorem states that 

p"P(rp = n) = gi(n/p") + e'ip, n) , 

where sup„ |e'(p, n) | — )• as p — )■ oo. 

Lemma 21. The sequence {X^, . . . , X'^^^/^^-^)/n conditioned on T|-„i/a^-| = n converges in distribu- 
tion to a random sequence (A^, Ag, . . .), defined inductively by 



\* r- A A* A* ST A* ^ PliO)x Qxil - y - Z) 

A,^,€dz A„...,A,,^A^=y)=-^^j^^—^dz, 0<z<l-y. 
Proof. The case i = 1 is obtained by using the local limit theorem in 



1/ IF'(T"r„i/cri_i =n - \nz\) 

nP(Xr = M I r^„v..i = n) = \n'/'^x] lnz\F{X, = M) - | • 

One then reasons inductively, in an elementary way. Details are left to the reader. □ 

The limiting sequence (A*,i > 1) has same distribution as the sequence of jumps of the sub- 
ordinator {Ty,0 < y < x), conditionally given Tx = 1, and arranged in size-biased order, see [32l 
Chapter 4] or [9]. We will denote by AT*^ this randomly ordered sequence of jumps. Hence, pro- 
vided we have the right to apply dominated convergence in (29), we obtain, using xpi{—x) = Qx{l), 

r r°° dr 

n'-"-tn{9) ^ / -^,Q.m[9{^^,x]) I Tx = l]- (32) 

Using scaling for the subordinator {Ty, y >0), the previous integral can be rewritten as 

^ f ^Qi(--°)Eb(-'^AI|*,,j) I T, = X-"] , 

and changing variables u = x~°' shows that this is equal to 

c c 
^ Q,{u)duE[ugiAT^,^,^/u) \ T, = n] = —E[T^g{AT^,^,^/T,)] . 
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Finally, the sequence AT[o^i] of jumps of T before time 1 is the sequence of atoms of a Poisson 
measure with intensity given by (30). Using (31), it thus has same distribution as a{a — l)c~^T{2 — 
a)^^(Ai, A2, . . .)) as defined in Section 2.1 Using the notations therein and (31), we get after 
rearrangements 



api(O) 



E[Tig(Ar[*^i]/ri)] 



r(2 - a)\^/o'a^T{2 - l/a) 



r(2 - 



-1) 

a) 

-11 



l/ot 



r(2 - a) 



E 



T ' 



i > 1 



as wanted. It remains to justify that the convergence (32) is indeed dominated. To this end, using 
(29) and the fact that qn{p{^) = \n^^°'x]) < C[n^/"x]~°, it suffices to show that the expectation 
term in this equation is bounded by C[n"^/"x] /n^^" for some C independent of n, and for x G [0, 1]. 
In turn, since gix) < C(l — xi) for some finite C > 0, it suffices to substitute this upper-bound to 
g. Now, we have P(Xi = m) < Cm~^~^^°' for every m, so that 



E 



n 



T|-„i/o 



m 



n 



< 



< C 



m=l 

n 

[ni/"x] 1 ^ 



n 



n 



m 



n 



l/a 



TTl 

x\—F(Xi 
n 



m) 



m] 



E 



n) 



-[nV^x] + 1) 



l/a 



< c 



n 



l/a 



where we have used that ( n — Tn^^^^W^l^Sn—m — — ["tT/^'^'^^x] ~t~ 1) is uniformly bounded (in tl^tti^x^ and 
that n^/"P(S'„ = — [n^/"x]) is uniformly bounded away from for x G [0,1]. This is the wanted 
bound, concluding the proof of Proposition [TBI hence of Theorem [3j 



6 Scaling limits of uniform unordered trees 

In this section, we fix once and for all an integer m £ {2, ... ,00} and consider trees in which 
every vertex has at most m children. We use the notations of Section |2.2| and let T„ be uniformly 
distributed in for n > 1. 

The first difficulty we have to overcome is that the sequence (T^, n > 1) is not Markov branching 



as defined in Section 1.2.2 We will therefore start in Section [6.1| by coupling this sequence with 



a family of Markov branching trees that are asymptotically close to T„,n > 1, and then check in 



Section 6.2 that the coupled trees satisfy (H). 



Let us fix some notation. For t G t!^\ we can write t = (t^^\ . . . ,t('^')) with Yli=i = n — 1, 
and we let A(t) G Vn-i be the partition obtained by arranging in decreasing order the sequence 
(#t(^), . . . , #1*^^^)) (of course, this does not depend on the labeling of the trees t^^\ . . . ,t^^^). Let 
fj{k) be the set of of multiset^ with k elements in T^™^. By convention, we set Fj(0) = {0}. 

^Recall that a multiset with k elements in some set A is an element of the quotient set A'^ /&k, where acts in 
the natural way by permutation of components. 
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Then, for A G Vn-i with p{X) < m, we have a bijection 



n-1 



{teJt^:X{t) = X}^llf,{mj{X)), (33) 

i=i 

obtained by grouping the subtrees of t born from the root with size j into a multiset, denoted by 
fj(t), of mj{\) trees. From this, we deduce that fj(T„),l < j < n — 1 are independent uniform 
random elements in Fj(mj(A)) conditionally given X{Tn). However, the uniform random element 
in fj{k) has a different distribution from the multiset induced by k i.i.d. uniform element in Tj'"^ 
as soon as A; > 2. This is what prevents T„ from enjoying the Markov branching property, i.e. from 
having law Qn, where for n > 1, qn is the law of A(r„_|_i). 
Letting Fj{k) = #F_,(/c), the previous bijection yields 

SW := #{t e Tt^ : A(t) = A} = J] F,(m,(A)) . 

j=i 

When p{X) > m, we set si"^^ = 0. Of course, letting T^'"'' = i^T^\ we also have 

^n — 2-^ "^n ■ 

Using the obvious fact that Fj{k) < T^™^Fj(A; - 1), we obtain the rough but useful bound 

S(A) < tJ^)s(a_.A3,...,a,(.,)_ (3^^ 

We recall the key result ^ of Otter |3T], which is used throughout the proofs below: 

rp(m) P 



3/2 ■ 



Setting Tq™^ = 1 by convention, we obtain that for p = pm > I, and two constants > 1 > k > 0, 

„n n 

T(;r^<K-^, n>0 TM>k4^, n>l. (35) 
Note that we also have T^™^ < Kp^ for all n > 0. Last, we let k = Km- 



6.1 Coupling 

Let be the uniform probability distribution over T^™\ and let qn = X^pn+i be the law of the 
partition of n induced by the subtrees born from the root of a ^n+i-distributed tree. For every 
n > 1, we want to construct a pair of random variables (T„, T^) on some probability space (fi, P), 
such that 

• r„ has law pn 

• has law 

• for every e > 0, lim„_^oo E [dcHP ("-"^T„, n^^T^)] = 0. 

Recall that if Tn has distribution and conditionally on A(T„) = A, then fj(r„), I < j < n — 1 
are independent, respectively uniform in Fj(mj(A)). We are going to need the following fact. 
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Lemma 22. For every j,k > I, let Fj be uniform in fj{k) and Fj be the multiset induced by 
an i.i.d. sequence of k random variables with law fj,j. Let Aj be the set of elements in fj{k) with 
components that are pairwise distinct. Then 

(i) One has ¥{Fj e Aj) < F(Fj e Aj). _ 

(ii) The conditional distributions of Fj and Fj given Aj are equal. 



Proof. For a finite set A, the number of multisets with k elements is ^{A'^ ^ i^A^/kl. Then 



#T;.'")(#Tf)-l)...(#T;.™)-A; + l) 



kl#fj{k) 



< 



#t(™)(#T;"^)-1)...(#T;.™)-A; + 1) 



('")^fc 



P(F, G A,) 



This gives (i). Property (ii) is also obtained by counting: on the event Aj. The probability that Fj 
equals some given (multi)set S G Fj(fc) with all distinct elements is ^fj{k)~^, while the probability 
that Fj equals the same set S is k\{#j'f'^)~'' . Dividing by F{Fj £ Aj) and P(Fj e Aj) respectively 
gives the same result. □ 

The previous statement allows to construct a coupling between Fj and Fj, in the following way. 
Let f G Fj (/c). Consider three independent random variables f", f", B, such that the law of f" is the 
law of Fj conditionally given Aj , the law of f is the law of Fj conditionally given A^ and B is an 
independent Bernoulli random variable with ¥{B = 1) = F{Fj £ Aj)/¥(Fj £ Aj), which is indeed 
in [0, 1] by (i) in Lemma 22 , Set 



if f G Aj 



f = <( f" if f ^ Aj and = 
if f ^ Aj and B = I. 

We let Kj{f, ■) be the law of the multiset f thus obtained, hence defining a Markov kernel on fj{k). 
We say that the random variables F,F' are naturally coupled if {F,F') has law fi{df)Kj{i ,df'), 



where ^ is the law of F on fj{k). Using (ii) in Lemma 22, it is then easy to obtain the next result. 

Lemma 23. If Fj is uniform in fj{k) and (Fj,Fj) are naturally coupled, then the law of Fj is that 
of the multiset induced by k i.i.d. uniform elements in Tj. 

Next, we define a Markov kernel K{t,-) on t(™\ in an inductive way. Let K{»,{»}) = 1. 
Assume that the measure K{t, •) on has been defined for every t £ T^™) with #t < n — 1. 

Take t G 'T^\ and let A = A(t),p = p(A). Let fj(t) G Fj(mj(A)), 1 < j < n — 1 be the multisets of 
trees born from the root of t, respectively with size j. Let f^ (t) be independent random multisets, 
respectively with law Kj{fj{t), ■). We relabel the p elements of the multisets f^(t), 1 < j < n — 1 as 
t(i), . . . ,t(p), in non-increasing order of size, so that #t(j) = Aj — if there is some j with mj{X) > 2, 
we arrange the trees with same size in exchangeable random order. All these trees are in T^™) 
and have at most n — 1 vertices. By the induction hypothesis, conditionally on this family, we can 
consider another family t^-,^^, . . . ,1^^^ of independent trees with respective laws -ftr(t(j), •). Let K{t, •) 
be the law of the tree (t(j)) 1 < ^ < p)- This procedure allows to define the Markov kernel K{t, ■) 
for every tree in T^*"^. 

We say that the random trees {T,T'), defined on a common probability space, are naturally 
coupled if the law of {T,T') is fi{dt)K{t,dt'), where fi is the law of T. Is is easy to see that for 
every random variable T on T^™') with law /x, then, possibly to the cost of enlarging the probability 
space supporting T, one can construct a random variable T' so that (T, T') is naturally coupled. 
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Proposition 17. Let have law and {Tn^T!^ he naturally coupled. Endow these trees respec- 
tively with the measures jiXn ^'^^ I^T^ ■ Then, 

(i) the tree has distribution Qn, for every n > 1, 

(ii) for all a > 0, the Gromov-Hausdorjf-Prokhorov distance between aTn and aT^ is at most 2aj* 
where j* is the supremum integer j >1 so that there exist two subtrees ofTn with size j, born from 
the same vertex and which are equal (with the convention sup0 = 0), 

(iii) for all e > 0, E [dQupin^^Tn, n~^T^)] — )• as n ^ oo. 

Proof. We prove (i) by induction. For n = 1 the property is obvious. Assume the property holds 
for every index up to n — 1, and condition on A(T„) = A, which by definition has probabihty gn-i(A). 



As noticed before Lemma 22 the multisets Fj = fj(Tn), 1 < i < ?^ — 1 are independent, respectively 



uniform in fj{mj{X)). Conditionally on Fj,l < j < n — 1, let < j < n — 1 be independent 



with respective laws Kj{Fj,-). By Lemma 23 we obtain that Fj is the multiset induced by a 
sequence of mj(X) i.i.d. random variables, with law fij. Consequently, if we relabel the elements 
of Fj, 1 < j < n — 1 as T(^i), ■ ■ ■ )2^(p) in non-increasing order of size (and exchangeable random 
order for trees with same size), then we obtain that these trees are independent, respectively with 
distribution fix ■ Since, by definition of K, the natural coupling {Tn,T^) is obtained by letting 

= (^(j))! < i < p) where (Tj-j) , T^'^^ are naturally coupled, we readily obtain the Markov 
branching property, with branching laws {qn,n > !)• 

For (ii), we again apply an induction argument. The statement is trivial for n = 1. Now, in the 
first step of the natural coupling, the action of the Markov kernel Kj on fj(r„) leaves it unchanged 
if fj(r„) G Aj, i.e. if there are no ties in the multiset fj(T„). Consequently, with same notations as 
in the previous paragraph, a subtree of born from the root that appears with multiplicity 1 will 
also appear among T(i), . . . , T(p). 

Moreover, subtrees that are replaced are always replaced by trees with the same number of 
vertices and a tree with j vertices and edge- lengths a has height at most aj. So the Gromov- 
Hausdorff-Prokhorov distance between two trees with edge-lengths a that both decompose above 
the root in subtrees of same size j is at most 2aj (it is implicit in this proof that all trees are 
endowed with the uniform measure on their vertices). We now appeal to the following elementary 
Fact. Let t,t' be such that k = p(A(t)) =p(A(t')), and let t = (t(i), . . . ,\k)) and t' = (t^^^, . . • ,t(^)) 
with #t(j) = #t|^^ for \ <i <k. Then for every a > 0, 

(iGHp(at, at') < mc^^dGHp(at(i),at(.)) . 

From this we deduce that the Gromov-Hausdorff-Prokhorov distance between aTn and aT^ is 
at most 

(2a sup{l <j<n-l:Fje A",}) V sup sup dGHp{aTu),aTL) , 

l<j<n-li:#T(,)=i 

where (T^^-^ , T^.-^) is the natural coupling. The induction hypothesis allows to conclude. 

Last, for (iii), fix e G (0,1). The Gromov-Hausdorff-Prokhorov distance between n~^Tn and 
n~^Tl^ is bounded from above by 2n^~^ for all n > 1. Next, for 7 G (0, 1), let AZ be the subset 
of trees of T^™^ that have at least two subtrees born from the same vertex that are identical, and 
with size larger than n"^. By (ii), when T„ ^ AZ, dQY{-p{n~^Tn,n~^T!^) < 2n'^~^. Hence, 

IE[dGHp(n-^r„,n-%)] = E [dGHp(n-^r„, n-%,)l^^^g^.|] + E [dGHp(n-^T„, n-^)!^,.^^^.}] 

< 2n^-'F{Tn £ Al) + 2n^-' . 



Taking 7 < e and using Lemma 24 following right below, we get the result. □ 
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Lemma 24. For 7 G (0, 1), let AZ be the subset ofT^^ of trees t that have at least one vertex v 
such that at least two subtrees born from v are equal and have at least n'^ vertices. Then, 

¥{Tn G Al) = 0(/>^"^ 72^/2) as 00. 

This lemma will be an easy consequence of the following result. For every tree t and any vertex 
V of t, we let t^"-* denote the subtree of t rooted at v. When v* is taken uniformly at random among 
the vertices of t, we set t*^*) := t^^*-*. 

Lemma 25. The distribution of Tn conditionally on i^Tn 
l<k<n. 

Note that the event {#T,1*^ 



k is uniform on t[™\ for every 



k} has a strictly positive probability for all 1 < A; < n. 
Proof. Let k£{l,...,n}. For ah to G tJ.™\ using that P(r„ = t) = for t G T^'"\ 



»(ri*) = to) 



teT 



(m) 



rp(m) 

J- n 



(m) 



r^*) = to|r, = t 
(t(*) = to) 



E 



This quantity is independent of to G tI™'', because there is an obvious bijection between the sets 



{(t,u) : t G J^"^\v G t,t(^) = to} and {(t,w) : t G T'n"',v G t,t^^^ = ti} for ti G T^"'^ Hence the 
result. □ 

Proof of Lemma 24| Let AZ{k) be the subset of trees of T^™'^ whose decomposition above the 
root gives birth to at least two identical subtrees with size greater than n'^ , k < n. We first give 
an upper bound for the probability F{Tk G AZ{k)). To do so, we bound from above the number of 
trees of T^*"^ that decompose in at least two identical subtrees of size i {i < {k — l)/2): there are 

xj™^ choices for the tree with size i appearing twice. Then, there are T^l^2i forests with k — 1 — 2i 
vertices. Gluing the twins trees and a forest with k — 1 — 2i vertices to a common root gives a tree 



-(m) 



-im) 



with 



with k vertices (and a root branching in possibly more than m subtrees) and all trees in T^ 
at least two subtrees with size i can be obtained like this. From this we deduce that the cardinal 
of AZ{k) is at most Yyi=n-< 

F{neAl{k))<- 



(k-l)/2 (m)rp(m) 
(fe-l)/2 



1 



,{m) 



. In particular, using (35) and the fact that p > 1, 



i=ni 



^ k i=ni 

where C is a generic constant independent of n and k < n. Now, in the following lines, given T„, 
we let f 1, f2, . . . , Vn denote its vertices labeled uniformly at random, 

n 



'(TnGAZ) 



< 



E 



nF fc*) G Ali#Ti*^ 



E 



.i=l 



by Lemma |25| 
< 



n 



k=l 



'{n&Al{k))¥{i^T^*^ = k) 



k=l 



□ 



50 



6.2 Hypothesis (H) and conclusion 

It remains to check that the family of probability distributions on Vn,n > 1 defined by 

g(A) 

QniX) = P (A(r„+i) = A) = P {XiK^,) = A) = VA G Vn, 

satisfies the assumption (H) with 7 = 1/2, i = 1 and u proportional to the Brownian dislocation 
measure z/2. For this we recall and fix some more notations: 

* T^"^^ is the subset of T^™^ of trees with root degree less or equal to m — 2 

* T^T^ is the cardinal of T^T^ and V^^H^;) = E„>i T^T^x", = J2n>i T^T^x" 

* for A = (Ai, A2, . ..) G Vn, set Ar := Xli^s Aj = n - Ai - A2 
The main result of this section then reads 

Proposition 18. For all 2 < m < 00, and all continuous functions / : 5"'' — t- M such that 
|/(s)| <l-si /or sg5^ 

Note that the sum in the limit above is finite, since T^.™"^ < T^"^^ < Kp^ /k^/"^. This sum is 



explicit in terms of k and p when m = 2 or m = 00. See Section 2.2 for details 



With this proposition, it is easy to conclude the proof of Theorem |4] Indeed, together with 
Theorem [2] and Proposition 17 (i), it leads to the convergence 



J_T' ^ 



for the Gromov-Hausdorff-Prokhorov topology, where Cm = V 2/(-v/vrK;V'(l/p))- Then, by Propo- 
sition 17 (iii) and since (A^wj^gh) is a complete separable space, we can apply a Slutsky-type 



theorem to get 

1 ^ (d) ^ 

= ^71 r Cm / 1/2,1/2- 



n 71— ^-oo 



The rest of this section is devoted to the proof of Proposition 18 



6.2.1 Negligible terms 

We show in this section that the set of partitions A G Vn where either Ai > n(l — e) or Ar > ne 



plays a negligible role in the limit of Proposition 18 when we first let n — ?■ 00 and then e — >• 0. 
Lemma 26. There exists C G (0,oo) such that, for allO < e <1, 



e)3/2' 
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Proof. Using ( 34 ) and then ( 35 ) , we get 



XeV„,Xi>n(l-e) ^ ^ Ai = rn(l-e)l ^ ^ ii.&-a-\^ 

n 



< 



El 1 _ ^\ rp{m)^(m) 



Ai = rn(l-e)l 



^ Af'(n + l-Ai)3/2 



Ai = rn(l-e)l 

- (n(i-e))3/2 ^ ;^37^ ^ Ai-rta-^)i (i-^i/^)^^^' 



We conclude with the fact that the sum X]Ai=[n(i-e)] ~ ^i/n) ^^'^ is smaller than the integral 
Jn{i-e)^^ — x/n)^^^'^dx = 2n-y/e, and then use the lower bound of (35) for T^^^i- □ 

To deal with the partitions where Ar > ne, we need the following lemma when m = oo. We 
denote by T^°°'" ^ the number of trees of T^°°^ whose subtrees born from the root have sizes at 
most a, a > 1. 

Lemma 27. Let m = oo. There exists A, B > such that 

^(^oo,a-) ^ Q^p(^_Bk/a), VA; G N anda>l. 

Proof. Recall that T denotes the set of all (rooted, unordered) trees and rewrite the power series 
ip = as V'(x) = X^tGT According to [231 Section VII. 5] its radius of convergence is 1/p < 1 
and ip{l/ p) = 1. Note also that ^(0) = 0. Now, we consider a random tree T in T with distribution 
defined by 

P(T = t) = p-*\ 

If C0(t) denotes the degree of the root of t, we just have to show that 

P(c0(T) = r)<A' exp{-B'r), for some A', B' > and all r > 1. (36) 

Indeed, each tree with k + 1 vertices and a decomposition in subtrees with sizes at most a has a 
root degree larger or equal to k/a. So, if the above inequality holds, we will have 

^(oo^a-) ^ pk+i^^^^^T^ > ^/^^ J, g -p(oo)^ < eM-B'{k/a - 1)), 



as required. To get (36), note that 

P(c.(r) = r)= Y: P'*' = EI E E P''-^^<-'<-^-^*'^ 

teT,C0(t)=r k=l ' ti,...,tfceT mi + ...+mfc=r 

pairwisc distinct mi>l 

which is obtained by considering the multiset of r subtrees of a tree t, made of k distinct trees with 
multiplicities mi, . . . , m^. Hence, 



r ^ k 

P(c^(T)=r) < P~'Y.V^ E ]1^(P' 



kl 

k=l mi + ...+mk=r i=l 

7Tti>l 

L^tJ k t fc 

"-'Em E n«''--) + "-' E M E n*'"--) 

k=l mi + ...+mfc=r j=l k=[cr]+l mi+...+mk=r i=l 

rrii > 1 mi > 1 
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where the c g]0, 1[ chosen for this spht will be specified below. 

We first bound from above the second term. Using that 'tp{p~"^') < il){p~^) = 1 for mj > 1 and 
that Emi+...+m,=r,m,>i = (fclD ' ^6 obtain 

fc=[crj+l mi+...+mk=r i=l ^ ^ k=l ^ / L J 

mi>l 

which decays exponentially fast as r — )• oo, for every c g]0, 1[. 

Now we wiU check that the sum Ylh^i ¥.^mi+...+mk=r,mi>iIli=i'^(.P~"^') ^^^o decays expo- 
nentially in r, provided that c €]0, 1[ is chosen sufficiently small. Since V'(O) = 0, we have that 
ip{x) < Cx for some C < oo and all x G [0,p~"'^]. Hence 

LcrJ k lcr\ f, k 

Eh E n "■("-"■•) £ E^ E n^-- 

k=l mi+...+mk=r i=l k=l mi + ...+mfc=r j=l 

mi>l mi>l 

[crj 



< explCK-- (''■-l')exp(At-A(r-LcTj)) 

for all A>0 _ , , V ^ / 



k=r— [crJ 

< exp(C) (p^i exp(-A(l - c))(exp(A) + 1))" . 

When c — )• 0, exp(— A(l — c))(exp(A) + !)—)• p~^{l + exp(— A)), which is strictly smaller than 
1 for A large enough. Hence, fix such a large A and then take c > sufficiently small so that 
p~^ exp(— A(l — c))(exp(A) + 1) < 1. This ends the proof. □ 



Lemma 28. For all e > 0, 

Proof. • If m = 2, Af = for all A G Vn and the assertion is obvious. 

• Assume now that 3 < m < oo and note that when A G Vn with p(A) < m, one has that Ar > ne 
implies (m — 2)A3 > ne, in particular Ai > A2 > ne/{m — 2). Hence, 

n-2 ln-Xr-ne/{m-2)\ + 

El Q^-^) <r \ " \^ rp(m)rp(m) ™(m) 

-l{A,.>ne}S„+l ^ 2^ 2^ J-Ai J- n-A,-Ai A^+l " 

XeVn \r=\n£] Ai = [ne/(m-2]) 



Then for C a generic constant, using (35), the latter term multiplied by ^/n/T^^-^ is bounded from 
above by 

n-2 ln-Xi-ne/{m-2)\+ ^ 

C7n^/2(n + 1)3/2 ^ ^ 



X:^ne^ A, = rnt7r..-2)1 (n - A^ - Ai)3/2 ( A, + 1)3/2 



< ^ n2(n - 2)(Ln - ng/(m - 2)J+) 

— j^3*3/2 \ 
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• Next we turn to the case where m = oo. Let 7 G (5/6, 1). On the one hand, by the same token 
as for the m < 00 cases, 



n 



-,(00) 



El 



1 q(^) 

{Xr>ne} ^{X2>n-'}^n+l 



n-2 [n-X.-n-yl- 



< 



E E 



1 



-n+1 XeVn 



Xr=\ne] Xi = \ni] 



Ai/^(n-Ar-Ai)3/2(Ar + 1)3/2 



< C- 



n 



37+3/2 



n- 



0(n5/2-37^ =0(1), 



since 5/2 — 37 < when 7 > 5/6. On the other hand, we use Lemma 27 to get 



n 



l{A,>ne}l{A2<n7}S£^l < 



n— 2 n— Ar — 1 

^ ^ ^ ^ ~(oo)™-,{oo) rp(oo,nT-) 



-n+1 AeP„ 



^(00) 



E\ ^ r|-i(00)r|-,(00) rp(oO,n^ 
/ . -^Ai -^n-A,.-Ai -^Ar+1 



-^n+1 Ar=rne] Ai=l 

< Cn^ exp(-Bn^^^e) = o(l) . 



□ 



6.2.2 Proof of Proposition 18 



We rely on the following lemma. Let be the subset of Vn of partitions of n with exactly two 
parts. 

Lemma 29. Let / : 5^ — )• M be continuous. 
(i) For all a G Z+ and all e G (0, 1), as n ^ 00, 



n 



E / 



-,(m) 
-n+1 AP'P'''" 

Ai<n(l-e) 



Ai A2 + g ^ ^ qW > /"^ /(a:^, 1 - a;, 0, 

n ' n ' 



0' • • • ^n+l-a 



P^^" Jl/2 x3/2(l-x)3/2 



-dx. 



(ii) Moreover, there exists G (0,oo) siic/i i/iai, for all n > 1, all < a < ne/2 and all non- 
increasing non-negative sequences {ai,i > 1) with ^j>i < a/n, 



Ai<n(l-e) 

Proof, (i) For large enough n 
'Ai A2 + a 



"i^ /(—, — + 01,02,03, •••) S 

■n+1 Ae'P'''" 



(A) rp(m) 

n+1— a a+1 



< 



(a + 1)3/2- 



n— a 

Ai<n(l-e) 



n n 



0' • • ^i+i-a - / ^2 ~ 2n' 2 2n'^' ' ' ■ ^ 



Ln(l-£)J 



+ 



E /(^,i-^,o,..^)t<7M-U^ 



Ai = [(n-a)/2j+l 

On the one hand, by Otter's approximation result for T|^2a)/2 

F(n-a)/2(2) = T|:i)/,(T[:1^)/, + l)/2 ~ n'p-^iin - a)/2)-^/2 = o(Ti-) /V^). 
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On the other hand, still using Otter's result, we get that for all rj > 0, provided that n is large 
enough, 



n 



-,{m) 



[nils)] 

E 



-ra+1 Ai = [(n-a)/2j + l 

2 , ln{l-e)\ 



n 



Ai 



n 



,0, 



rp(m)rp(m) 

Ai n—a—\i 



< 



(k — r])p^^°- n 



E / 

Ai = [(n-a)/2j + l 

f{x,l-x,0, 



\ n 



Ai 



n 



0, . 



(n + 1)3/2 



n 



3/2 



,3/2 



(n-a- Ai)3/2 



n->oo (k — r/)p^+" 7x/2 x3/2(l — x)3/2 

Letting 77 —5- 0, this gives 



-dx. 



lim sup 



n 



(m) 



Ln(l-e)J 



/ 



Ai 



n 



1 



Ai 



n 



,0,... T 



-i(m)rp(m) 

'Ai n— a— A] 



< 



/(x,l -x,0. 



-dx. 



-n+l Ai = [(n-a)/2j + l 

We obtain the liminf similarly, hence (i). 

(ii) We will use that S^^l^__^ < t'--^^t'-^\_^^ for all A G V^^^. Recah that / is bounded on S^. 
There exits then a generic constant C independent of n and a < ne/2 such that 



n 



E 



^(m) 

■"+1 Aepbi^^,Ai<n{l-£) 
Ln(l-£)J 



Al A2 \ (A) rr(m) 



< 



< 



< 



-,(m) 



Er-p(m)r-p(m) rp(»n) 
-•-Al -"-n-a-Ai a+1 
-n+l Ai = [(n-a)/2] 

Ln(l-e)J 



1 



n 



3/2 



C 1 

(a + 1)3/2 n 



3/2 
1 

n3/2 



(n - a - Ai)3/2 
n3/2 



[n{l-e)\ 



where we have used for the last inequality that n — a > n(l — e/2) since a < ne/2 and that 
n — a — Xi > (n — Ai)/2 since a < ne/2 and Ai < n(l — e). This upper bound is of the form 
Cun/{a + 1)3/2 -^^jjgpg (un,n > 1) is a sequence independent of a converging to a finite limit as 
n — )• oo. Hence the result. □ 



Proof of Proposition 18 By Lemmas 26 and 28, the set of partitions where either Ai > n(l — e) 
or Ar > ne/3 will play a negligible role in the limit when we first let n — t- oo and then e — )■ 0. 
Hence we concentrate first on the following sums (for e G (0, 1)), where we use that for all A G Vn, 
Al < n{l — e) and A^ < ne/3 implies A2 > A3: 



E 



AeP„ 

Al <?i(l— f:),Ar<ne/3 



Lne/3J 



) s;fi = E E E / 



Al \2 + k 



n 



n 



,0, 



Lne/3J 

+ E E E 

fc=o ^ieVk agp'^^j. 



Al A2 /ii 

; 5 1 ' 

nun 



/('^,^,o,.. 

n n 



{j(A) (o'j\ 
■^n-fc+l^fc+l \'^') 



j{A) 



^n-fe+l'-'fc+l' 
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The first sum in the riglit-hand side of (37) is equal to 



E E / 

fc=0 A6Pb!l\,Ai<n(l-e) 



Ai X2 + k 



n n 



0,... s- 



;(A) 



p(m) 

-fc+1 



(38) 



which, multiphed by ^/n/T^^-^^, according to Lemma 29 (i) and (ii) ((ii) imphes dominated conver- 
gence), converges to 



oo /-l-e 



/(x, 1 -x,0, , 



3/2(1 _ ^)3/2 



-dx. 



(39) 



Next, let 5 > 0. Since / is continuous (hence uniformly continuous) on the compact set S^, we can 
choose e small enough so that the absolute value of the second sum in the right-hand side of (37) 
is bounded from above by 



2E E 

k=0 \<^Vl%M<n{l-e) 



(40) 



Similarly as above, when multiplied by y^/T^'^^-^, this quantity converges to 



oo „ 

^Z^ 1+fe / 

fc=0 ' 



1-e 



(5 A (1 - x) 

/2 X3/2(1 - 3;)3/2 



V - ~ (m) /" 5 A ( 



^ 5A(l-x) 

x)3/2 



dx 



(41) 



by Lemma 29 (i) and (ii). 



Now let ?7 > be fixed. For 6 and e sufficiently small, the terms (41 ) and the limsup of Lemma 



26 are smaller than rj, and the term (39) is in a neighborhood of radius r] of the intended limit 



oo rji(m) » 

k=0 ^ 



^1 /(x,l-x,0,...) 

12 x3/2(l-x)3/2 



dx. 



(42) 



Next, such small 6 and e being fixed, letting n — )■ oo, and using Lemma 28 and the convergences of 
(38) to (39) and of (40) to (41 ), we get that V^EasPu f in) si+i/T^+i is indeed in a neighborhood 
oFradius 7r/ of (42) for all n large enough. □ 
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